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ABIOTIC STRESS RESPONSIVE POLYNUCLEOTIDES AND POLYPEPTIDES 



Cross Reference to Related Applications 

This application claims priority of U.S. Provisional Patent Application 60/300,1 12 
5 filed June 22, 2001; U.S. Provisional Patent Application 60/314,662 filed August 24, 2001; 
U.S. Provsional Patent Application 60/325,277 filed September 26, 2001; and U.S. 
Provisional Patent Application 60/332,132 filed November 21, 2001, each of which is 
incorporated by reference in its entirety for all purposes including, but not limited to, all text 
figures, tables, sequence listings, supplemental tables, supplemental figures, appendices and 
1 0 material submitted on electronic media. 



Reference to Material Submitted on Compact Disc 

The sequence listing accompanying this application is contained on compact disc 
pursuant to PCT Rules $9bis and 89/er, and PCT Administrative Rules Part 8. The compact 
1 5 disc containing the sequence listing has been submitted in triplicate on compact discs labeled 
"Copy 1", "Copy T and "Copy 3." An additional copy labeled M CFR" has also been 
provided. Each compact disc contains a text file named 60073_Seq_Lst which is 38.8 MB in 
size and which was created on June 20, 2002. 

20 Field of the Invention 

The present invention relates to polynucleotides and polypeptides the expression of 
which is altered in response to abiotic stress and to regulatory elements that are responsive to 
abiotic stress. 



25 

Background 

Recent advances in the speed and ease of large-scale DNA sequencing and the 
computational power of bioinformatic data analysis have permitted scientists to develop 
methods for studying biological processes on a genome-wide scale, giving rise to the field 
30 known as genomics. Genomics is viewed as a powerful tool for answering fundamental and 
complex questions regarding the structure, function and evolution of biological systems. 

Structural genomics includes the sequencing and mapping of a genome, where the 
material sequenced may be genomic DNA, or the sequencing of other materials such as 
expressed sequence tags (ESTs) and development of genetic maps based on information from 
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visible and/or molecular markers, or physical maps constructed, for example, using yeast 
artificial chromosomes (YACs) and/or bacterial artificial chromosomes (BACs). Sequencing 
and mapping of the genomes of various organisms provide the tools for comparative 
genomics, which can be used to study how genes and genomes are structured and how they 
5 evolved, and may provide insight into the functions of genes and other DNA regions by 
studying their parallels in other organisms. 

Functional genomics utilizes the tools of genomics research to elucidate function, 
largely by large-scale measurements of gene expression to create expression profiles of the 
genome, and by gene mutagenesis and/or knock-out to observe the in vivo effects of altered 

10 genes. Functional genomics encompasses gene discovery, gene expression, protein and 
nucleic acid structure and function, gene and gene product interactions, and genomic 
approaches to breeding and comparative studies relevant to ecology and evolutionary 
biology. Important tools of functional genomics include: arrays, typically microarrays, of 
nucleotide sequences to simultaneously measure the expression of hundreds or thousands of 

1 5 genes; differential display, which makes it possible to analyze and compare transcribed genes 
to detect differentially expressed mRNAs; transcript profiling or cDNA-amplified fragment 
length polymorphism (cDNA-AFLP) to analyze gene expression; gene knock-out using 
classical mutagenesis or insertional mutagenesis; and bioinformatic tools to analyze 
experimental data and make predictions. 

20 Comparative functional genomics provides the opportunity to translate the knowledge 

gained from one organism, often a model system such as Arabidopsis, to another system such 
as a crop plant, by comparing genome organization, function, and evolution. Comparative 
functional genomics facilitates a greater understanding of: phylogenomics, the study of the 
evolution of genes and gene families using DNA sequence information from organisms 

25 selected at major branch points along the phylogenetic continuum; biochemical pathways, the 
orderly flow of materials and information in living organisms genomic analysis of complex 
traits, with the objective to elucidate the mechanisms of multigenic control; and host- 
environment interactions, aimed at understanding the molecular genetic basis for host 
responses to changes in environment on a genomic scale. 

30 Proteomics is a "post-genomics" set of methodologies that encompasses studies of 

protein-structure, -function, and interaction. Regulomics is. the study of gene expression at 
the level of genetic network regulatory mechanisms to identify and characterize regulatory 
elements (both cis-acting and trans-acting) that control gene expression. Furthermore, the 
associated genes, including key mediators in diseases and metabolic processes, can be 
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identified and the disease-related genetic regulatory circuits can be constructed, facilitating 
the discovery of new and better points of therapeutic intervention. Pharmacogenomics refers 
to using genomic information for the design and discovery of new drugs and new therapeutic 
approaches. 

5 Computational genomics refers both to computational processes for analyzing 

sequence and expression data from in vivo experiments and also to computational processes 
for carrying out "i« silico" experiments, including electronic hybridization (sequence 
comparison), predictions of gene structure, protein structure, protein function, direct protein- 
protein interactions, and higher-order interactions on a systems-wide basis. As such, the term 

10 "computational" can be applied widely, to denote computational comparative genomics, 
computational functional genomics, computational comparative functional genomics, 
computational comparative regulatory genomics, and the like. Computational genomics goes 
beyond the analysis of empirical information such as structure, function, and sequence, to 
develop theoretical frameworks using genomic information to create and test new proteins, to 

15 carry out protein design, to evolve genes and genomes in silico, to develop multidimensional 
phytogenies, to predict function based on context-dependent information including gene 
neighborhood and clusters of orthologous groups of proteins (COGs), and to use sequence 
data with genetic algorithms for modeling surfaces or solving complex problems. 

Integrative approaches to using biological and computational results obtained using 

20 genomics and post-genomics methodologies permit the development of systems-level tools to 
understand, model, and predict the biological functions. Combinatorial biology can be 
practiced by integrating biological information and computational information. Cellomics is 
an example of a systems-level approach to cellular or organismal function in space and over 
time. 

25 Germplasm improvement has been practiced for millennia to direct the evolution of 

plants and animals. Germplasm improvement can be directed to increasing both quantity and 
quality of an agricultural commodity, and may involve enhancing prie-existing traits or 
introducing traits that do not naturally occur in a given organism. Traditional or conventional 
breeding based on selection and crossing to introduce or enhance desired traits has been the 

30 avenue of germplasm improvement prior to the development of methods for genetic 

engineering. Genomics makes important contributions to both traditional and molecular 
methods of germplasm improvement. Genomics accelerates the discovery of genes that 
confer key traits and provides maps, markers, and other tools for enhancing traditional 
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breeding. Molecular methods of germplasm improvement utilize the products of genomics 
research in the design of genetic constructs to achieve a desired purpose. 

Plants and plant products provide the primary sustenance, either directly or indirectly, 
for all animal life, including humans. For the majority of the world's human population and 
5 for many animals, plants and plant products provide the sole source of nutrition. As the 
world population increases, the best hope to prevent widespread famine is to increase the 
quantity and improve the quality of food crops, and to make the crops available to the regions 
of the world most in need of food. 

Throughout history, a continual effort has been made to increase the yield and 

10 nutritious value of food crops. For centuries, plants having desirable characteristics such as 
greater resistance to drought conditions or increased size of fruit were crossbred and progeny 
plants exhibiting the desired characteristics were selected and used to produce seed or 
cuttings for propagation. Using such classical genetic methods, plants having, for example, 
greater disease resistance, increased yield, and better flavor have been obtained. 

1 5 A new paradigm for germplasm improvement in the cereals is based on the extensive 

similarities among the world's food cereals and other grasses in terms of chromosomal gene 
content and gene order. (Aim et aL, Mol Gen Genet 241 :483, 1993) Due to the conservation 
of gene order, or synteny, within cereal genomes, a gene on the chromosome of one grass 
species can be expected to be present in a predicted location on a specific chromosome of a 

20 number of other grass family species. (Bennetzen et al. Proc Natl Acad Sci 95:1975, 1998). 
Chromosomes of the various species, most of which differ in chromosome numbers, can be 
arrayed in concentric circles such that a radial line from the central species with the smallest 
genome will pass through regions of similar genie content in each of the other species. This 
concept has led the plant genetics community to view the grass family as a single genetic 

25 system. Recognition of these relationships has led to the prospect of gaining sufficient 

genomic information from one species to understand much of the genetics of a broad array of 
species. The identification of genes controlling important pathways such as for insect 
resistance, isolation of genes of various types, determination of directional pathways of 
evolution and location of useful genes from exotic sources, decision making on biodiversity 

30 conservation, and many other applications in plant breeding will be easier because of the 
heightened understanding of genetic relationships. Although synteny is a well-studied 
phenomenon in the cereal (grass) genomes, similar results are emerging for all groups of 
species, both plant and animal. 
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Rice is an important cereal crop for human consumption, with approximately half a 
billion tons produced annually. Rice has a genome size that is considerably smaller than the 
other major cereals, which results in a higher gene density relative to the other cereals. This 
smaller genome size and higher gene density makes rice an attractive model system for cereal 
5 gene discovery efforts and germplasm improvement through traditional breeding and 
molecular methods. Although large-scale sequencing and mapping of the rice genome is 
currently underway, there is currently no complete, assembled, and annotated genome 
available for rice. 

Microarray technology is a powerful tool that can be used to identify the presence and level 

1 0 of expression of a large number of nucleotide sequences in a single assay. A microarray is 
formed by linking a large number of discrete polynucleotide sequences, for example, a 
population of polynucleotides representative of a genome of an organism, to a solid support 
such as a microchip, glass slide, or the like, in a defined pattern. By contacting the 
microarray with a nucleic acid sample obtained from a cell of interest, and detecting those 

1 5 polynucleotides expressed in the cell can hybridize specifically to complementary sequences 
on the chip, the pattern formed by the hybridizing polynucleotides allows the identification of 
clusters of nucleotide sequences that are expressed in the cell. Furthermore, where each 
polynucleotide linked to the solid support is known, the identity of the hybridizing sequences 
from the nucleic acid sample can be identified. 

20 A strength of microarray technology is that it allows the identification of differential 

gene expression simply by comparing patterns of hybridization. For example, by comparing 
the hybridization pattern of nucleic acid molecules obtained from cells of an individual 
suffering from a disease with the nucleic acids obtained from the corresponding cells of a 
healthy individual, genes that are differentially expressed can be identified. The 

25 identification of such differentially expressed genes provides a means to identify new genes, 
and can provide insight as to the etiology of a disease. 

Microarray technology has been widely used to identify patterns of gene expression 
associated with particular stages of development or of disease conditions in animal model 
systems, and is being applied to the identification of specific patterns of gene expression in 

30 humans. The recent availability of information for the genomes of plants provides a means to 
adapt microarray technology to the study of plant gene expression^ 

The identification of plant genes involved in conferring a selective advantage on the 
plant to an environmental challenge would facilitate the generation and yield of plants, 
thereby increasing the available food supply to an increasing world population. In addition, 
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such knowledge provides a basis for diagnostic tests to identify stresses to which plants are 
subjected allowing implementation of practices to counter the stress. Thus, a need exists to 
identify plant genes and nucleotide sequences that are involved in modulating the response of 
a plant to changing environmental conditions. The present invention satisfies this need and 
5 provides additional advantages. 



Summary 

The present invention relates to clusters or groups of polynucleotides the expression of 
which is altered in response to one or more stress conditions and in particular abiotic stresses. 

1 0 Such clusters include, for example, plant polynucleotides whose expression is altered in 

response to two or more different stress conditions; and plant polynucleotides the expression of 
which is altered in response to one stress condition, but not to others. The identification of such 
clusters, using microarray technology, has allowed the identification of plant stress-regulated 
polynucleotides in rice; and homologs and orthologs thereof in other plant species and in 

1 5 particular cereals. Thus, the invention provides isolated polynucleotide sequences of stress- 
regulated nucleotide sequences from cereals, and homologs and orthologs thereof; variants of 
such sequences, and nucleotide sequences encoding substantially similar cereal stress-regulated 
polypeptides expressed there from. Such sequences include, for example, sequences encoding 
transcription factors; enzymes, including kinases; and structural proteins, including channel 

20 proteins. Accordingly, the present invention also relates to an isolated polynucleotide 

disclosed herein comprising a coding region that encodes a stress-regulated polypeptide from 
a cereal, and portions thereof. Also included is a regulatory element selected from the 
regulatory elements disclosed herein or functional portions thereof, which is involved in 
regulating the response of a cereal to a stress condition, for example, exposure to an abnormal 

25 level of salt, osmotic pressure, temperature or any combination thereof. 

The present invention also relates to a recombinant polynucleotide, which comprises a 
cereal stress-regulated nucleotide sequence disclosed herein or functional portion thereof 
operatively linked to a heterologous nucleotide sequence. In one embodiment, the 
recombinant polypeptide comprises a cereal stress-regulated regulatory element disclosed 

30 herein operatively linked to a heterologous nucleotide sequence, which is not regulated by the 
regulatory element in a naturally occurring plant. The heterologous nucleotide sequence, 
when expressed from the regulatory element, can confer a desirable phenotype to a plant cell 
containing the recombinant polynucleotide. In another embodiment, the recombinant 
polynucleotide comprises a coding region disclosed herein, or a functional portion thereof, of 
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a plant stress-regulated polynucleotide operatively linked to a heterologous promoter. The 
heterologous promoter provides a means to express the encoded stress-regulated polypeptide 
constitutively, or in a tissue-specific or phase-specific manner. 

One aspect of the present invention provides a method for determining whether a test 
5 plant, for example a cereal, has been exposed to at least one stress condition, for example an 
abiotic stress, comprising determining polynucleotide expression in the test plant to produce 
an expression profile and comparing the expression profile of the test plant to the expression 
profile of at least one reference plant that has been exposed to at least one stress, for example, 
an abiotic stress. In one embodiment the expressed polynucleotides are selected from the 

10 group consisting of any of the polynucleotide sequences contained in the sequence listing. In 
another embodiment, the test and reference plants are rice plants and the expressed 
polynucleotides are selected from the group consisting of SEQ ID NOs. 1-4131, 8263-8353, 
8445-8829, 17505-17506 or a functional portion thereof; the group consisting of SEQ ED 
NOs. 2159-3176; the group consisting of SEQ ID NOs. 1964-2158; the group consisting of 

15 SEQ ID NOs. 855-1963; or the group consisting of SEQ ED NOs. 1-854. 

Another aspect provides an isolated nucleic acid sequence comprising a plant 
nucleotide sequence, of at least 10 nucleotides long, that hybridizes under stringent 
conditions, or high stringency conditions, to the complement of any one of SEQ ID NOs. 1- 
4131, 8263-8353, 8445-8829, 17505-17506, or a functional portion thereof, which is 

20 operably linked to a regulatory element or functional portion thereof. In one embodiment, 
the regulatory element or functional portion thereof alters transcription of an operatively 
linked nucleic acid sequence in response to an abiotic stress. In one embodiment, the 
regulatory element, or functional portion thereof, is selected from the group consisting of 
SEQ ID NOs. 13374-17504 or a functional portion thereof. 

25 Also provided are expression cassettes, plants and seeds comprising any of the above 

isolated sequences. 

Another aspect provides an isolated polynucleotide comprising a plant, for example, a 
cereal, nucleotide sequence containing a coding region for an abiotic stress responsive 
polypeptide, selected from the group consisting of SEQ ID NOs. 1-4131, 8263-8353, 8445- 
30 8829, 17505-17506, or a functional portion thereof; or a sequence that hybridizes under 

stringent conditions or highly stringent conditions, to the complement of.any one of SEQ ID 
NOs. 1-4131, 8263-8353, 8445-8829, 17505-17506, or a functional portion thereof In one 
embodiment, the sequence is selected from the group consisting of SEQ ID NOs. 1-854. In 
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another embodiment, the sequence is selected from the group consisting of SEQ ID NOs. 
855-4131, 8445-8829 and 17505-17506. 

A further aspect provides an isolated polynucleotide comprising a rice nucleotide 
sequence containing a coding region for an abiotic stress responsive polypeptide, selected 
5 from the group consisting of SEQ ID NOs. 1-4131, 8263-8353, 8445-8829, and 17505- 
1 7506, or a functional protion thereof; or a sequence that hybridizes under stringent 
conditions or highly stringent conditions, to the complement of any one of SEQ ID NOs. 1- 
4131, 8263-8353, 8445-8829, and 17505-17506, or a functional protion thereof. Additional 
aspects include, any of the afore disclosed nucleotide sequences, or functional portions 

10 thereof, wherein the polynucleotide is located in the drought tolerance QTL located on rice 
chromosome 3. Another embodiment provides any of the previously disclosed 
polynucleotides wherein said polynucleotide is from a genomic region syntenic with a maize 
cold tolerance QTL. A futher embodiment provides any of the previously disclosed 
polynucleotides wherein the polynucleotide is present in the drought tolerence QTL on rice 

1 5 chromosome 8. Still a futher embodiment provides any of the previously disclosed 

polynucleotides wherein the polynucleotide encodes a protein comprising a Universal Stress 
Protein A domain. 

One specific embodiment comprises an isolated nucleic acid molecule comprising a 
polynucleotide selected from the group consisting of: a) any one of the nucleotide sequences 

20 selected from the group consisting of SEQ ID NOs. 1-4131, 8263-8353, 8445-8829 and 
1 7505-1 7506; b) a function portion of any of the sequences of a); c) a polynucleotide that is 
substantially similar to a sequence of a) or b); d) a sequence of at least 15 nucletides that 
hybridizes under stringent conditions to a polynucleotide of a), b) or c); e) the complement of 
any sequence of a), b), c) or d); f) the reverse complement of any sequence of a), b), c) or d); g) 

25 a polynucleotide encoding any polypeptide selected from the group consisting of SEQ ID NOs. 
4132-8262, 8354-8444, and 8830-9214; and h) an allelic variant of any of the above. 

Another aspect provides an isolated polypeptide selected from the group consisting of 
SEQ ID NOs. 41 32-8262, 8354-8444, and 8830-9214 or a functional portion thereof. 
Specific embodiments include, but are not limted to, an isolated polypeptide comprising: a) 

30 any one of the amino acid sequences selected from the group consisting of SEQ ID NOs. 
4132-8262, 8354-8444, and 8830-9214; b.) a. functional portion of a); c) an amino acid 
sequence substantially similar to any sequence of a) or b); d) an amino acid sequence of a)-c) 
wherein there has been at least one conservative amino acid substitution; and e) an allelic 
variant of any of a)-d). 
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The invention further relates to a method of producing a transgenic plant, which 
comprises at least one plant cell that exhibits altered responsiveness to a stress condition. In one 
embodiment, the method can be performed by introducing a functional portion of plant stress- 
regulated nucleotide sequence into a plant cell genome, whereby the functional portion of the 
5 plant stress-regulated nucleotide sequence modulates a response of the plant cell to a stress 
condition. The functional portion of the plant stress-regulated nucleotide sequence can 
encode a stress-regulated polypeptide or functional peptide portion thereof, wherein 
expression of the stress-regulated polypeptide or functional peptide portion thereof either 
increases the stress tolerance of the transgenic plant, or decreases the stress tolerance of the 

1 0 transgenic plant. The functional portion of the plant stress-regulated nucleotide sequence 
encoding the stress-regulated polypeptide or functional peptide portion thereof can be 
operatively linked to a heterologous promoter. The functional portion of the plant stress- 
regulated nucleotide sequence also can comprise a stress-regulated regulatory element or a 
functional portion thereof, such as a minimal promoter. The stress-regulated regulatory 

1 5 element can integrate into the plant cell genome in a site-specific manner, whereupon it can 
be operatively linked to a heterologous nucleotide sequence, which can be expressed in 
response to a stress condition specific for the regulatory element; or can be a mutant 
regulatory element, which is not responsive to the stress condition, whereby upon integrating 
into the plant cell genome, the mutant regulatory element disrupts an endogenous stress- 

20 regulated regulatory element of a plant stress-regulated nucleotide sequence, thereby altering 
the responsiveness of the plant stress-regulated nucleotide sequence to the stress condition. 

One particular aspect provides a method for producing a transgenic plant comprising 
introducing into at least one plant cell a recombinant nucleic acid construct comprising i) any 
one ofSEQ ID NOs. 1-4131, 8263-8353, 8445-8829 and 17505-17506 or the complement 

25 thereof; ii) a polynucleotide substantially similar to any one of SEQ ID NOs. 1-4131, 8263- 
8353, 8445-8829 and 17505-17506 or the complement thereof; iii) any one of SEQ ID NOs. 
13374-17504 or the complement thereof; or iv) a polynucleotide substantially similar any one 
of SEQ ID NO 13374-17504 or the complement thereof. 

Further aspects include plants and uniform populations of plants made by the above 

30 methods as well as seeds and progeny from such plants. 

In another embodiment, a transgene introduced into a plant cell according to a method 
of the invention can encode a polypeptide that regulates expression from an endogenous plant 
stress-regulated nucleotide sequence. Such a polypeptide can be, for example, a 
recombinantly produced polypeptide comprising a zinc finger domain, which is specific for 
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the regulatory element, and an effector domain, which can be a repressor domain or an 
activator domain. The polynucleotide encoding the recombinant polypeptide can be 
operatively linked to and expressed from a constitutively active, inducible or tissue specific 
or phase specific regulatory element. Expression of the recombinant polypeptide from a 
5 plant stress-regulated promoter such as any of SEQ ED NOs. 1 3374- 1 7504 or a functional 
portion thereof can be particularly advantageous in that the polypeptide can be coordinaiely 
expressed with the endogenous plant stress-regulated nucleotide sequences upon exposure to 
a stress condition. The invention also provides transgenic plants produced by a method as 
disclosed, as well as to a plant cell obtained from such transgenic plant, wherein said plant 
10 cell exhibits altered responsiveness to the stress condition; a seed produced by the transgenic 
plant; and a cDNA or genomic DNA library prepared from the transgenic plant, or from a 
plant cell from said transgenic plant, wherein said plant cell exhibits altered responsiveness to 
the stress condition. 

Thus, one aspect provides an isolated nucleic acid comprising a regulatory element 

15 comprising a sequence selected from the group consisting of SEQ ID NOs. 13374-17504 or a 
functional portion thereof, wherein said regulatory element alters transcription of an 
operatively linked polynucleotide in a plant cell, for example a cereal, in response to an 
abiotic stress. An additional aspect provides an isolated nucleic acid comprising a regulatory 
element having a sequence substantially similar to a sequence selected from the group 

20 consisting of SEQ ID NOs. 13374-17504 or a functional portion thereof, wherein said 

regulatory element alters transcription of an operatively linked polynucleotide in a plant cell 
in response to an abiotic stress. 

Further aspects provide an expression cassette comprising as operatively linked 
components, a regulatory element comprising a sequence from the group consisting of SEQ 

25 ID NOs. 1 3374-1 7504 or a functional portion thereof, wherein said regulatory element alters 
transcription of an operatively linked polynucleotide in a plant cell, for example a cereal, in 
response to an abiotic stress; a coding region; and a termination sequence. Another aspect 
provides an expression cassette comprising as operatively linked components, a regulatory 
element comprising a sequence substantially similar to a sequence selected from the group 

30 consisting of SEQ ID NOs. 13374-17504 or a functional portion thereof, wherein said 

regulatory element alters transcription of an.operatively linked.polynucleotide in a plant cell, 
for example a cereal, in response to an abiotic stress; a coding region; and a termination 
sequence. Also provided are host cells and seeds comprising such expression cassettes, 
plants containing such host cells and seeds and progeny of plants containing said host cells. 
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In related aspects, the coding region of the expression cassettes comprise sequences encoding 
marker proteins and sequences involved in gene silencing such as antisense sequences, 
double stranded RNAi sequences, a triplexing agent, and sequences comprising dominant 
negative mutations. In additional related aspects, the coding regions comprise sequences 
5 encoding polypeptides that alter the response of a plant to an abiotic stress. 

The present invention also relates to a method of modulating the responsiveness of a 
plant, for example a cereal, cell to a stress condition. Such a method can be performed, for 
example, by introducing a functional portion of a stress-regulated polynucleotide described 
herein into the plant cell, thereby modulating the responsiveness of the plant cell to a stress 

10 condition. Such a method can result in the responsiveness of the plant cell being increased 
upon exposure to the stress condition, which, in turn, can result in increased or decreased 
tolerance of the plant cell to a stress condition; or can result in the responsiveness of the plant 
cell to the stress condition being decreased, which, in turn, can result in increased or 
decreased tolerance of the plant cell to a stress condition. In one embodiment, the functional 

15 portion of the plant stress-regulated polynucleotide can integrate into the genome of the plant 
cell, thereby modulating the responsiveness of the plant cell to the stress condition. In 
another embodiment, the functional portion of the plant stress-regulated nucleotide sequence 
encodes a stress-regulated polypeptide or functional peptide portion thereof, and can be 
operatively linked to a heterologous promoter. The functional portion of the plant stress- 

20 regulated polynucleotide also can contain a mutation, whereby upon integrating into the plant 
cell genome, the polynucleotide disrupts (knocks-out) an endogenous plant stress-regulated 
sequence, thereby modulating the responsiveness of the plant cell to the stress condition. 
Depending on whether the knocked-out gene encodes an adaptive or a maladaptive stress- 
regulated polypeptide, the responsiveness of the plant will be modulated accordingly. In still 

25 another embodiment, the functional portion of the plant stress-regulated polynucleotide can 
comprise a stress-regulated regulatory element, which can be operatively linked to a 
heterologous nucleotide sequence, the expression of which can modulate the responsiveness 
of the plant cell to a stress condition. Such a heterologous nucleotide sequence can encode, 
for example, a stress-inducible transcription factor such as DREB1 A. The heterologous 

30 nucleotide sequence also can encode a polynucleotide that is specific for a plant stress- 
regulated nucleotide sequence, for example, an antisense molecule, an RNAi molecule, a 
ribozyme, and a triplexing agent, any of which, upon expression in the plant cell, reduces or 
inhibits expression of a stress-regulated polypeptide encoded by the nucleotide sequence, 
thereby modulating the responsiveness of the plant cell to a stress condition, for example, an 
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abnormal level of cold, osmotic pressure, salinity or any combination thereof. Accordingly, 
the invention also relates to a plant cell, for example a cereal obtained by such a method, and 
to a plant, for example a cereal, comprising such a plant cell. 

The present invention also relates to a method of expressing a heterologous nucleotide 
5 sequence in a plant cell, for example a cereal. Such a method can be performed, for example, 
by introducing into the plant cell a plant stress-regulated regulatory element operatively 
linked to the heterologous nucleotide sequence, whereby, upon exposure of the plant cell to a 
stress condition, the heterologous nucleotide sequence is expressed in the plant cell. In one 
embodiment, the stress regulated element is any of the sequences described herein that are 

10 capable of altering transcription of an operatively linked sequence in response to an abiotic 
stress, for example, SEQ ID NOs. 13374-17504 or a functional portion thereof. The 
heterologous nucleotide sequence can encode a selectable marker, a diagnostic marker, or a 
polypeptide that confers a desirable trait upon the plant cell, for example, a polypeptide that 
improves the nutritional value, digestability or ornamental value of the plant cell, or a plant 

1 5 comprising the plant cell . 

The present invention further relates to a method of modulating the activity of a 
biological pathway in a plant cell, wherein the pathway involves a stress-regulated 
polypeptide or a non-protein regulatory molecule, for example a ribozyme, such as those 
encoded by sequences contained in any of SEQ ID NOs. 1-4131, 8263-8353, 8445-8829 and 

20 17505-17506. Such a method can be performed by introducing a functional portion of a plant 
stress-regulated polynucleotide into the plant cell, thereby modulating the activity of the 
biological pathway. The method can be performed with respect to a pathway involving any 
of the stress-regulated polypeptides as disclosed herein or encoded by the polynucleotides 
disclosed herein, as well as using homologs or orthologs thereof. Also included are stress- 

25 regulated polypeptides or non-protein regulatory molecules that are encoded by sequences 
that are substantially similar to SEQ ID NOs. 1-4131, 8263-8353, 8445-8829 and 17505- 
17506. 

The present invention also relates to a method of identifying a polynucleotide that 
modulates a stress response in a plant cell, for example a cereal. In one embodiment the 
30 method comprises determining polynucleotide expression in a plant exposed to at least one 
abiotic stress to-produce an expression profile and identifying sequences whose expression is 
altered at least two fold compared to plants not exposed to the stress. Such an expression 
profile can be obtained, for example, by contacting an array of probes representative of a 
plant cell genome with nucleic acid molecules expressed in a plant cell exposed to the stress; 
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detecting a nucleic acid molecule that is expressed at a level different from a level of 
expression in the absence of the stress. The method can further comprise introducing the 
differentially expressed nucleic acid molecule into a plant cell; and detecting a modulated 
response of the genetically modified plant cell to a stress, thereby identifying a 
5 polynucleotide that modulates a stress response in a plant cell. In one embodiment, the 

differentially expressed nucleic acid is selected from the group consisting of SEQ ID NOs. 1- 
4131, 8263-8353, 8445-8829 and 17505-17506. The abiotic stress can be any abiotic stress 
such as exposure to an abnormal level of cold, osmotic pressure, salinity or any combination 
thereof. The contacting is under conditions that allow for specific hybridization of a nucleic 
1 0 acid molecule with a probe having sufficient complementarity, for example, under stringent 
or highly stringent hybridization conditions. Expression of the polynucleotide can increase 
or decrease the tolerance of the plant cell to the stress, and the nucleic acid molecule can be 
expressed at a level that is less than or greater than the level of expression in the absence of 
the stress. 

1 5 The present invention additionally relates to a method of identifying a stress condition 

to which a plant cell, for example a cereal, was exposed by comparing an expression profile 
from a test plant suspected of having been exposed to at least one stress to an expression 
profile obtained from a reference plant, preferably of the same species, which has been 
exposed to the suspected stress. Such a method can be performed, for example, by contacting 

20 nucleic acid molecules expressed in the test plant cell with an array of probes representative 
of the plant cell genome; detecting a profile of expressed nucleic acid molecules 
characteristic of a stress response, and comparing the expression pattern in the test plant to 
the expression pattern obtained from a reference plant thereby identifying the stress condition 
to which the plant cell was exposed. The contacting is under conditions that allow for 

25 specific hybridization of a nucleic acid molecule with probes having sufficient 

complementarity, for example, under stringent hybridization conditions. In one embodiment, 
the stress is an abiotic stress. The profile can be characteristic of exposure to a single stress 
condition, for example, an abnormal level of cold, osmotic pressure, or salinity, or can be 
characteristic of exposure to more than one stress condition, for example, cold, increased 

30 osmotic pressure and increased salinity. In one embodiment, the polynucleotides whose 

expression is detected are selected from the group consisting of SEQ ID NOs. 1-4131, 8263- 
8353, 8445-8829 and 17505-17506. In one embodiment, the plant is a rice plant. It is 
contemplated that in any of the above embodiments that the number of polynucleotides 
whose expression will be determined will be greater than one. Thus within the scope of the 
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invention are embodiment in which the number of different polynucleotides whose 
expression is determined for any one plant is at least 10, at least 25, at least 50, at least 100, 
at least 250, at least 500, and at least 750. 

The present invention further relates to a transgenic plant, for example a cereal, which 
5 contains a nucleic acid construct comprising a function portion of any of the plant stress- 
regulated polynucleotides of SEQ ID NOs. 1-4131, 8263-8353, 8445-8829 and 17505-17506. 
In one embodiment, the transgenic plant exhibits altered responsiveness to a stress condition as 
compared to a corresponding reference plant not containing the construct. Such a transgenic 
plant can contain, for example, a construct that disrupts an endogenous stress-regulated 

1 0 nucleotide sequence in the plant, thereby reducing or inhibiting expression of the gene in 

response to a stress condition. Such a knock-out can increase or decrease tolerance of the plant 
to a stress condition. The transgene also can comprise a coding sequence of a plant stress- 
regulated nucleotide sequence, such as any of SEQ ID NOs. 1-4131, 8263-8353, 8445-8829 
and 17505-17506, which can be operatively linked to a heterologous regulatory element such as 

1 5 a constitutively active regulatory element, an regulated regulatory element, a tissues specific or 
phase specific regulatory element, or the like. In another embodiment, the transgenic plant 
contains a nucleic acid construct comprising a plant stress-regulated regulatory element, such as 
any of SEQ ID NOs. 13374-17504, which can be operatively linked to a heterologous nucleotide 
sequence that can encode a polypeptide or functional RNA. Expression of the heterologous 

20 polypeptide can confer a desirable characteristic on the plant, for example, can improve the 
nutritional or ornamental value of the transgenic plant. In still another embodiment, the 
transgenic plant contains multiple nucleic acid constructs, which can be multiple copies of the 
same construct, or can be two or more different constructs. 

The present invention also relates to a plant stress-regulated regulatory element, which is 

25 contained in a plant stress-responsive polynucleotide disclosed herein for example any of SEQ 
ID NOs. 13374-17504; a homolog or ortholog thereof. The invention also provides a method 
of identifying an agent, for example a transcription factor, that specifically binds to, or 
activates a plant stress-responsive regulatory element and in particular a stress responsive- 
regulatory element from any of SEQ ID NOs. 1 3374-17504. Such a method can be 

30 performed, for example, by contacting the regulatory element with a plant cell extract, and 
identifying polypeptides that specifically bind to the regulatory element. Confirmation that 
the specifically binding polypeptide is a transcription factor can be demonstrated using, for 
example, the stress-regulated regulatory element operably linked to a reporter gene, and 
detecting expression of the reporter gene. Control constructs comprising a regulatory 
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element, other than a plant stress-regulated regulatory element, operatively linked to a 
reporter molecule can be used to confirm that the transcription factor is specific for the plant 
stress-regulated regulatory element. A polynucleotide encoding such a transcription factor 
also can be obtained. 

5 The present invention also relates to a method of using a functional portion of a plant 

stress-regulated nucleotide sequence disclosed herein to confer a selective advantage on a 
plant cell, for example a cereal plant cell. In one embodiment, such a method is performed 
by introducing a plant stress-regulated regulatory element disclosed herein into a plant cell 
such as those described herein, wherein, upon exposure of the plant cell to a stress condition 

10 to which the regulatory element is responsive, a nucleotide sequence operatively linked to the 
regulatory element is expressed, thereby conferring a selective advantage to plant cell. The 
operatively linked nucleotide sequence can be, for example, a transcription factor, the 
expression of which induces the further expression of nucleotide sequences involved in a 
stress response, thereby enhancing the response of a plant to the stress condition. In another 

1 5 embodiment, a coding sequence of a plant stress-regulated sequence described herein is 

introduced into the cell, thereby providing the plant with a selective advantage in response to 
a stress condition. In still another embodiment, the method results in the knock-out of any of 
the plant stress-regulated sequences described herein in a first population of plants, thereby 
providing a selective advantage to a stress condition in a second population of plants. 

20 The invention further relates to a method of identifying an agent that modulates the 

activity of a stress-regulated regulatory element of a plant, for example a cereal. In a 
particular embodiment, is provided a method for identifying an agent that alters the activity 
of an abiotic stress responsive regulatory element comprising contacting the agent or a 
composition containing an agent to be tested with at least one abiotic stress responsive 

25 regulatory element, for example selected from the group consisting of SEQ ID NOs. 1 3374- 
17504, or a functional portion thereof, and determining the effect of the agent on the ability 
of the regulatory sequence to regulate transcription. In one embodiment, the regulatory 
element can be operatively linked to a heterologous polynucleotide encoding a reporter 
molecule, and an agent that modulates the activity of the stress-regulated regulatory element 

30 can be identified by detecting a change in expression of the reporter molecule due to 

contacting the regulatory element with the agent. Such a method can be performed in vitro in 
a plant cell-free system, or in a plant cell in culture or in a plant in situ. In another 
embodiment, the agent is contacted with a transgenic plant containing an introduced plant 
stress-regulated regulatory element, and an agent that modulates the activity of the regulatory 
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element is identified by detecting a phenotypic change in the transgenic plant. The methods 
of the invention can be performed in the presence or absence of the stress condition to which 
the particularly regulatory element is responsive. 

Another aspect provides a method for identifying an agent that alters abiotic stress 
5 responsive polynucleotide expression in a plant or plant cell, for example a cereal, 

comprising contacting a plant or plant cell with a test agent; subjecting the plant cell or plant 
cell to an abiotic stress or combination of stresses before, during or after contact with the 
agent to be tested; obtaining an expression profile of the plant or plant cell and comparing the 
expression profile of the plant or plant cell to an expression profile from a plant or plant cell 

10 not exposed to the abiotic stress or combination of stresses. In one embodiment, the 
expression profile comprises expression data for at least one sequence selected from the 
group consisting of SEQ ID NOs. M131, 8263-8353, 8445-8829, and 17505-17506. In 
additional embodiments, the plant or plant cell is a rice plant. By at least one, it should be 
realized that the above embodiments contemplate that the expression profile discussed above 

1 5 can contain expression data for greater than one of the described sequences, for example at 
least 10 sequences, at least 25 sequences, at least 50 sequences, at least 100 sequences, at 
least 250 sequences, at least 500 sequences or at least 750 sequences. 

Still another aspect provides nucleotide probes for the diagnosis of abiotic stress in 
plants, for example cereals, comprising nucleotide sequences of at least 15, 25, 50 or 100 

20 nucleotides, that hybridize under stringent or highly stringent conditions to at least one 

sequence selected from the group consisting of SEQ ID NOs. 1-4131, 8263-8353, 8445-8829, 
and 17505-17506. 

An additional aspect provides a method for selecting plants, for example cereals, 
having an altered resistance to abiotic stress comprising obtaining nucleic acid molecules 

25 from the plants to be selected; contacting the nucleic acid molecules with one or more probes 
that selectively hybridize under stringent or highly stringent conditions to a nucleic acid 
sequence selected from the group consisting of SEQ ID NOs. 1-4131, 8263-8353, 8445-8829, 
and 17505-17506; detecting the hybridization of the one or more probes to the nucleic acid 
sequences wherein the presence of the hybridization indicates the presence of a gene 

30 associated with altered resistance to abiotic stress; and selecting plants on the basis of the 
presence or absence of such hybridization. In one embodiment, marker-assisted selection is 
accomplished in rice. In another embodiment, marker assisted selection is accomplished in 
wheat using one or more probes which selectively hybridize under stringent or highly 
stringent conditions to sequences selected from the group consisting of SEQ ID NOs. 9881- 
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1 1967. In yet another embodiment, marker assisted selection is accomplished in maize or 
com using one or more probes which selectively hybridize under stringent or highly stringent 
conditions to sequences selected from the group consisting of SEQ ID NOs. 11968-13373. In 
still another embodiment, marker assisted selection is accomplished in banana using one or 
5 more probes which selectively hybridize under stringent or highly stringent conditions to 
sequences selected from the group consisting of SEQ ID NOs. 92 1 5-9880. In each case 
marker-assisted selection can be accomplished using a probe or probes to a single sequence 
or multiple sequences. If multiple sequences are used they can be used simultaneously or 
sequentially. 

10 A further aspect provides a method for monitoring a population of plants comprising 

providing at least one sentinel plant, for example a cereal, containing a recombinant 
polynucleotide comprising a stress responsive regulatory sequence selected from the group 
consisting of SEQ ID NOs. 13374-17504 which is operatively linked to a nucleotide 
sequence encoding a detectable marker, for example a fluorescent protein such as a green 

1 5 fluorescent protein, a yellow fluorescent protein, a cyan fluorescent protein, a red fluorescent 
protein or an enhanced or modified form thereof. 

A further aspect provides a computer readable medium having stored thereon 
computer executable instructions for perfoiming a method comprising receiving data on 
nucleotide sequence expression in a test plant of at least one nucleic acid molecule having at 

20 least 70%, at least 80%, at least 90% or at least 95%, sequence identity to a nucleotide 

sequence selected from the group consisting of SEQ ID NOs. 1-4131, 8263-8353, 8445-8829 
and 17505-17506; and comparing expression data from said test plant to expression data for 
the same nucleotide sequence or sequences in a plant which has been exposed to at least one 
abiotic stress. 

25 Yet a further aspect provides a computer readable medium having stored thereon a 

data structure comprising, sequence data for at least one nucleic acid molecule having at least 
70%, at least 80%, at least 90%, or at least 95%, nucleic acid sequence identity to a 
polynucleotide selected from the group consisting of SEQ ID NOs. 1-4131, 8263-8353, 
8445-8829 and 17505-17506, or the complement thereof; and a module receiving the nucleic 

30 acid molecule sequence data which compares the nucleic acid molecule sequence data to at 
least one other nucleic acid sequence. 

An additional aspect provides a monoclonal or polyclonal antibody to any of the 
abiotic stresss related polypeptides disclosed herein, for example, SEQ ID NOs. 4132-8261, 
8263-8353 and 8830-9214. 
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A father aspect provides a method for identifying a homolog or ortholog of an abiotic 
stress responsive polynucleotide comprising determining the nucleotide sequence of a 
plurality of isolated polynucleotides to create a set of nucleotide sequences and translating the 
nucleotide sequences in the set to derive one or more putative amino acid sequences, based 
5 on one or more of the possible reading frames of the nucleotide sequences and their 

complementary sequences. Selecting an amino acid sequence of an abiotic stress-responsive 
protein and comparing the amino acid sequence of the abiotic stress-responsive protein with 
at least one of the putative amino acid sequences. Identifying putative amino acid sequences 
having homology with at least a region of the amino acid sequence of the abiotic stress- 

10 responsive protein; and correlating putative amino acid sequences having homology to 

translated nucleotide sequences. In one embodiment the amino acid sequence of an abiotic 
stress responsive protein is selected from the group consisting of SEQ ID NOs. 4132-8262, 
8354-8444, and 8830-9214. In another embodiment, the method further comprises, 
assembling a plurality of the translated nucleotide sequences that encode the putative amino 

15 acid sequences based on regions of overlap comprising at least 10 base pairs of identical 
sequences between two translated nucleotide sequences to form one or more nucleotide 
contigs. Translating the one or more nucleotide contigs into one or more amino acid contigs. 
Comparing the one or more amino acid contigs with the amino aicd sequence of the abiotic 
stress responsive protein to determine homology between the amino acid contig and at least a 

20 region of the abiotic stress responsive protein; and identifying at least one homolog of at least 
a region of the abiotic stress responsive protein based on the homology determined. 

Detailed Description 

The following detailed description is provided to aid those skilled in the art in 
25 practicing the present invention. Even so, this detailed description should not be construed to 
unduly limit the present invention as modifications and variations in the embodiments 
discussed herein can be made by those of ordinary skill in the art without departing from the 
scope of the present invention. 

All publications, patents, patent applications, public databases, public database 
30 enteries, and other references cited in this application are herein incorporated by reference in 
their entirety as if each individual publication, patent, patent application, public database, 
public database entry, or other reference were specifically and individually indicated to be 
incorporated by reference. 
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The present invention relates to clusters or groups of polynucleotides and the 
polypeptides encoded thereby, the expression of which is altered in response to abiotic stress 
conditions and to regulatory elements that are responsive to abiotic stress. In one embodiment, 
the invention provides polynucleotides and their associated polypeptides whose expression is 
5 altered at least 2X by abiotic stress. By a two fold alteration is meant that the change in 

expression level can be described by using a multiplier or divisor of at least two. For example, 
if the expression level were set at 100 prior to stress exposure, at 2X alteration would result in an 
expression level of > 200 or < 50. Abiotic stress conditions, such as a shortage or excess of 
solar energy, water and nutrients, and salinity, high and low temperature, or pollution (e.g., 

10 heavy metals), can have a major impact on plant growth and can significantly reduce the 
yield, for example, of cultivars. Under conditions of abiotic stress, the growth of plant cells 
is inhibited by arresting the cell cycle in late Gl, before DNA synthesis, or at the G2/M 
boundary (see Dudits, Plant Cell Division, Portland Press Research, Monograph; Francis, 
Dudits, and Inze, eds., 1997; chap. 2, page 21; Bergounioux, Protoplasma 142:127-136, 

1 5 1 988). The identification of stress-regulated polynucleotide clusters, using microarray 
technology, provides a means to identify plant stress-regulated nucleotide sequences. 

The polynucleotides and polypeptides disclosed herein are expected to be functional in a 
wide variety of plants and especially cereals. The wide-spread applicability of the sequences 
disclosed is supported by the finding of expressed sequence tags from banana, wheat and maize 

20 or com having homology to the disclosed polynucleotide sequences. Table 2 shows list of such 
sequences and the SEQ ID NOs of the corresponding banana, wheat and/or maize ESTs. 

As used herein, the term "cluster," when used in reference to stress-regulated 
polynucleotides, refers to polynucleotides that have been selected by drawing Venn diagrams, 
and selecting those nucleotide sequences that are regulated only by a selected stress condition. 

25 The selected stress condition can be a single stress condition, for example, cold, osmotic stress 
or salinity stress, or can be a selected combination of stress conditions, for example, cold, 
osmotic stress and salinity stress. In addition, a cluster can be selected based on specifying that 
all of the nucleotide sequences are coordinately regulated, for example, they all start at a low 
level and are induced to a higher level. However, a cluster of saline stress-regulated nucleotide 

30 sequences, for example, that was selected for coordinate regulation from low to high, also can be 
decreased in response to cold stress or osmotic stress. By varying the parameters used for 
selecting a cluster of nucleotide sequences, those nucleotide sequences that are expressed ifi a 
specific manner following a stress can be identified. 
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As used herein in reference to a polynucleotide or nucleic acid sequence, "isolated" 
means a polynucleotide or nucleic acid sequence that is free of one or both of the nucleotide 
sequences which flank the polynucleotide in the naturally-occurring genome of the organism 
from which the polynucleotide is derived. The term includes, for example, a polynucleotide 
5 or fragment thereof that is incorporated into a vector or expression cassette; into an 
autonomously replicating plasmid or virus; into the genomic DNA of a prokaryote or 
eukaryote; or that exists as a separate molecule independent of other polynucleotides. It also 
includes a recombinant polynucleotide that is part of a hybrid polynucleotide, for example, 
one encoding a polypeptide sequence. 

10 As used herein polynucleotide" and "oligonucleotide" are used interchangeably and 

refer to a polymeric (2 or more monomers) form of nucleotides of any length, either 
ribonucleotides or deoxyribonucleotides. Although nucleotides are usually joined by 
phosphodiester linkages, the term also includes polymeric nucleotides containing neutral 
amide backbone linkage composed of aminoethyl glycine units. This term refers only to the 

15 primary structure of the molecule. Thus, these terms include double- and single-stranded 
DNA and RNA as well DNA/RNA hybrids that may be single-stranded, but are more 
typically double-stranded. In addition, the term also refers to triple-stranded regions 
comprising RNA or DNA or both RNA and DNA. The. strands in such regions may be from 
the same molecule or from different molecules. The regions may include all or one or more 

20 of the molecules, but more typically involve only a region of some of the molecules. The 
terms also include known types of modifications, for example, labels, methylation, "caps", 
substitution of one or more of the naturally occurring nucleotides with an analog, 
intemucleotide modifications such as, for example, those with uncharged linkages (e.g. 
methyl phosphonates, phophotriesters, phosphoamidates, carbamates etc.), those containing 

25 pendant moieties, such as, for example, proteins (including for e.g., nucleases, toxins, 
antibodies, signal peptides, poly-L-lysine, etc.), those with intercalators (e.g., acridine, 
psoralen, tec,), those containing alkylators, those with modified linkages (e.g. alpha anomeric 
nucleic acids, etc.), as well as unmodified forms of the polynucleotide. Polynucleotides 
include both sense and antisense, or coding and template strands. The terms include naturally 

30 occurring and chemically synthesized molecules. 

As used herein, "sequence" means the linear order in which monomers occur in a 
polymer, for example, the order of amino acids in a polypeptide or the order of nucleotides in 
a polynucleotide. 
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A "recombinant" nucleic acid is one produced by human intervention in the 
nucleotide sequence, typically selection or production. Alternatively, it can be a nucleic acid 
made by generating a sequence comprising fusion of two or more fragments which are not 
naturally contiguous to each other. Thus, for example, products made by transforming cells 
5 with any unnaturally occurring vector is encompassed, as are nucleic acids comprising 

sequences derived using any synthetic oligonucleotide process. Such is often done to replace 
a codon with a redundant codon encoding the same or a conservative amino acid, while 
typically introducing or removing a sequence recognition site. Alternatively, it is performed 
to join together nucleic acid segments of desired functions to generate a single genetic entity 
10 comprising a desired combination of functions not found in the commonly available natural 
forms. Restriction enzyme recognition sites are often the target of such artificial 
manipulations, but other site specific targets, e.g., promoters, DNA replication sites, 
regulation sequences, control sequences, or other useful features may be incorporated by 
design. 

1 5 As used herein, the term "abiotic stress" or "stress" or "stress condition" refers to the 

exposure of a plant, plant cell, or the like, to a non-living ("abiotic") physical or chemical 
agent or condition that has an adverse effect on metabolism, growth, development, 
propagation and/or survival of the plant (collectively "growth"). A stress can be imposed on 
a plant due, for example, to an environmental factor such as water (e.g., flooding, drought, 

20 dehydration), anaerobic conditions (e.g., a low level of oxygen), abnormal osmotic 

conditions, salinity or temperature (e.g., hot/heat, cold, freezing, frost), a deficiency of 
nutrients or exposure to pollutants, or by a hormone, second messenger or other molecule. 
Anaerobic stress, for example, is due to a reduction in oxygen levels (hypoxia or anoxia) 
sufficient to produce a stress response. A flooding stress can be due to prolonged or transient 

25 immersion of a plant, plant part, tissue or isolated cell in a liquid medium such as occurs 
during monsoon, wet season, flash flooding or excessive irrigation of plants, or the like. A 
cold stress or heat stress can occur due to a decrease or increase, respectively, in the 
temperature from the optimum range of growth temperatures for a particular plant species. 
Such optimum growth temperature ranges are readily determined or known to those skilled in 

30 the art. Dehydration stress can be induced by the loss of water, reduced turgor, or reduced 
_water content of a cell, tissue, organ or whole plant. Drought stress can be induced by or 
associated with the deprivation of water or reduced supply of water to a cell, tissue, organ or 
organism. Salinity-induced stress (salt-stress) can be associated with or induced by a 
perturbation in the osmotic potential of the intracellular or extracellular environment of a cell. 
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As disclosed herein, plant stress-regulated polynucleotides and clusters of stress- 
regulated polynucleotides have been identified. Surprisingly, several of the stress-regulated 
polynucleotides previously were known to encode polypeptides having defined cellular 
functions, including roles as transcription factors, enzymes such as kinases, and structural 
5 proteins such as channel proteins, but were not identified as being stress-regulated. The 
identification of rice stress-regulated nucleotide sequences has provided a means to identify 
homologous and orthologous nucleotide sequences in other plant species using procedures 
described herein. 

As used herein, the term "substantially similar", when used with respect to a 

1 0 nucleotide sequence, means a nucleotide sequence corresponding to a reference nucleotide 
sequence, wherein the corresponding sequence encodes a polypeptide having substantially 
the same structure and function as the polypeptide encoded by the reference nucleotide 
sequence, e.g., where only changes in amino acids not affecting the polypeptide function 
occur. Desirably, the substantially similar nucleotide sequence encodes the polypeptide 

1 5 encoded by the reference nucleotide sequence. The percentage of identity between the 
substantially similar nucleotide sequence and the reference nucleotide sequence is at least 
60%, at least 75%, at least 90%, at least 95%, or at least 99%, including 100%. A nucleotide 
sequence is "substantially similar" to reference nucleotide sequence that hybridizes to the 
reference nucleotide sequence in 7% sodium dodecyl sulfate (SDS), 0.5 M NaP(>4, 1 mM 

20 EDTA at 50°C with washing in 2X SSC, 0.1% SDS at 50°C; in 7% sodium dodecyl sulfate 
(SDS), 0.5 M NaP04, 1 mM EDTA at 50°C with washing in IX SSC, 0.1% SDS at 50°C 
(stringent conditions); in 7% sodium dodecyl sulfate (SDS), 0.5 M NaPC>4, 1 mM EDTA at 
50°C with washing in 0.5X SSC, 0.1% SDS at 50°C (high stringency); in 7% sodium dodecyl 
sulfate (SDS), 0.5 M NaP0 4 , 1 mM EDTA at 50°C with washing in 0.1X SSC, 0.1% SDS at 

25 50°C (very high stringency); or in 7% sodium dodecyl sulfate (SDS), 0.5 M NaPC>4, 1 ™M 
EDTA at 50°C with washing in 0.1X SSC, 0.1% SDS at 65°C (extremely high stringency). 

As is well known in the art, stringency is related to the T m of the hybrid formed. The 
T m (melting temperature) of a nucleic acid hybrid is the temperature at which 50% of the 
bases are base-paired. For example, if one the partners in a hybrid is a short oligonucleotide 

30 of approximately 20 bases, 50% of the duplexes are typically strand separated at the T m . In 
this case, the T m reflects a time-independent equilibrium that depends on the concentration of- 
oligonucleotide. In contrast, if both strands are longer, the T m corresponds to a situation in 
which the strands are held together in structure possibly containing alternating duplex and 
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denatured regions. In this case, the T m reflects an intramolecular equilibrium that is 
independent of time and polynucleotide concentration. 

As is also well known in the art, T m is dependent on the composition of the 
polynucleotide (e.g. length, type of duplex, base composition, and extent of precise base 
5 pairing) and the composition of the solvent (e.g. salt concentration and the presence of 
denaturants such formamide). On equation for the calculation of T m can be found in 
Sambrook et al. (Molecular Cloning, 2nd ed., Cold Spring Harbor Press, 1989) and is: 
T m = 81.5EC - 16.6(logi 0 [Na + ]) = 0.41(% G + C) - 0.63(% formamide) - 600/L) 
Where JL is the length of the hybrid in base pairs, the concentration of Na + is in the range of 
10 0.01 M to 0.4M and the G + C content is in the range of 30% to 75%. Equations for hybrids 
involving RNA can be found in the same reference. Alternative equations can be found in 
Davis et al, Basic Methods in Molecular Biology, 2nd ed., Appleton and Lange, 1994, Sec 6- 
8. 

Likewise, the term "substantially similar," when used in reference to a polypeptide 

1 5 sequence, means that an amino acid sequence relative to a reference (query) sequence shares at 
least about 65% amino acid sequence identity, at least about 75% amino acid sequence identity, 
at least about 85%, at least about 90% , or at least about 95% or greater amino acid sequence 
identity. Generally, sequences having an E < 10" 8 are considered to be substantially similar to a 
query sequence. Such sequence identity can take into account conservative amino acid changes 

20 that do not substantially affect the function of a polypeptide. 

Homology or identity is often measured using sequence analysis software such as the 
Sequence Analysis Software Package of the Genetics Computer Group (University of 
Wisconsin Biotechnology Center, 17 10 University Avenue, Madison, WT 53705). Such 
software matches similar sequences by assigning degrees of homology to various deletions, 

25 substitutions and other modifications. The terms "homology" and "identity," when used herein 
in the context of two or more nucleic acids or polypeptide sequences, refer to two or more 
sequences or subsequences that are the same or have a specified percentage of amino acid 
residues or of nucleotides that are the same when compared and aligned for maximum 
correspondence over a comparison window or designated region as measured using any number 

30 of sequence comparison algorithms or by manual alignment and visual inspection. 

For sequence comparison, typically one sequence acts as a reference sequence, to which 
test sequences are compared. When using a sequence comparison algorithm, test and reference 
sequences are entered into a computer, subsequence coordinates are designated, if necessary, 
and sequence algorithm program parameters are designated. Default program parameters can be 
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used, or alternative parameters can be designated. The sequence comparison algorithm then 
calculates the percent sequence identities for the test sequences relative to the reference 
sequence, based on the program parameters. 

The term "comparison window" is used broadly herein to include reference to a segment 
5 of any one of the number of contiguous positions, for example, about 20 to 600 positions, for 
example, amino acid or nucleotide position, usually about 50 to about 200 positions, more 
usually about 100 to about 150 positions, in which a sequence may be compared to a reference 
sequence of the same number of contiguous positions after the two sequences are-optimally 
aligned. Methods of alignment of sequence for comparison are well known in the art. Optimal 

1 0 alignment of sequences for comparison can be conducted, for example, by the local homology 
algorithm of Smith and Waterman (Adv. Appl. Math. 2:482, 1981), by the homology alignment 
algorithm of Needleman and Wunsch (J. Mol. Biol 48:443, 1970), by the search for similarity 
method of Person and Lipman (Proc. Natl. Acad. ScL USA 85:2444, 1988), each of which is 
incorporated herein by reference; by computerized implementations of these algorithms (GAP, 

1 5 BESTFIT, FASTA, and TFASTA in the Wisconsin Genetics Software Package, Genetics 
Computer Group, 575 Science Dr., Madison, WI); or by manual alignment and visual 
inspection. Other algorithms for determining homology or identity include, for example, in 
addition to a BLAST program (Basic Local Alignment Search Tool at the National Center for 
Biological Information), ALIGN, AMAS (Analysis of Multiply Aligned Sequences), AMPS 

20 (Protein Multiple Sequence Alignment), ASSET (Aligned Segment Statistical Evaluation 

Tool), BANDS, BESTSCOR, BIOSCAN (Biological Sequence Comparative Analysis Node), 
BLIMPS (BLocks LMProved Searcher), FASTA, Intervals & Points, BMB, CLUSTAL V, 
CLUSTAL W, CONSENSUS, LCONSENSUS, WCONSENSUS, Smith- Waterman 
algorithm, DARWTN, Las Vegas algorithm, FNAT (Forced Nucleotide Alignment Tool), 

25 Framealign, Framesearch, DYNAMIC, FILTER, FSAP (Fristensky Sequence Analysis 

Package), GAP (Global Alignment Program), GENAL, GIBBS, GenQuest, ISSC (Sensitive 
Sequence Comparison), LALIGN (Local Sequence Alignment), LCP (Local Content 
Program), MACAW (Multiple Alignment Construction & Analysis Workbench), MAP 
(Multiple Alignment Program), MBLKP, MBLKN, PIMA (Pattern-Induced Multi-sequence 

30 Alignment), SAGA (Sequence Alignment by Genetic Algorithm) and WHAT-IF. Such 

alignment programs can also be used to screen genome databases to identify polynucleotide 
sequences having substantially identical sequences. 

A number of genome databases are available for comparison. Several databases 
containing genomic information annotated with some functional information are maintained by 
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different organizations, and are accessible via the internet, for example, http://wwwtigr.org/tdb; 
http://www.genetics.wisc.edu; http://genome-www.stanford.edu/-ball; http://hiv-web.lanl.gov; 
http://www.ncbi.nlm.nih.gov; http://www.ebi.ac.uk; http://Pasteur.fr/othertoiology; and http:// 
www.genome.wi.mit.edu . 
5 In particular, the BLAST and BLAST 2.0 algorithms using default parameters are 

particularly useful for identifying polynucleotides and polypeptides encompassed within the 
present invention (Altschul et al. fNucleic Acids Res. 25:3389-3402, 1977; J. Mol. Biol. 
215:403-410, 1990). Software for performing BLAST analyses is publicly available through the 
National Center for Biotechnology Information (http://www.ncbi.nlm.nih.gov). This algorithm 

10 involves first identifying high scoring sequence pairs (HSPs) by identifying short words of 
length W in the query sequence, which either match or satisfy some positive-valued threshold 
score T when aligned with a word of the same length in a database sequence. T is referred to as 
the neighborhood word score threshold (Altschul et al., supra, 1977, 1990). These initial 
neighborhood word hits act as seeds for initiating searches to find longer HSPs containing them. 

1 5 The word hits are extended in both directions along each sequence for as far as the cumulative 
alignment score can be increased. Cumulative scores are calculated using, for nucleotide 
sequences, the parameters M (reward score for a pair of matching residues; always >0). For 
amino acid sequences, a scoring matrix is used to calculate the cumulative score. Extension of 
the word hits in each direction is halted when: the cumulative alignment score falls off by the 

20 quantity X from its maximum achieved value; the cumulative score goes to zero or below, due 
to the accumulation of one or more negative-scoring residue alignments; or the end of either 
sequence is reached. The BLAST algorithm parameters W, T, and X determine the sensitivity 
and speed of the alignment. The BLASTN program (for nucleotide sequences) uses as defaults 
a wordlength (W) of 1 1, an expectation (E) of 10, M=5, N=4 and a comparison of both strands. 

25 For amino acid sequences, the BLASTP program uses as defaults a wordlength of 3, and 

expectations (E) of 10, and the BLOSUM62 scoring matrix (see Henikoff and Henikoff, Proc. 
Natl. Acad. ScL USA 89:10915, 1989) alignments (B) of 50, expectation (E) of 10, M=5, N=-4, 
and a comparison of both strands. 

The BLAST algorithm also performs a statistical analysis of the similarity between two 

30 sequences (see, for example, Karlin and Altschul, Proc. Natl. Acad. ScL USA 90:5873, 1993). 
One measure of similarity provided by BLAST algorithm is the smallest sum probability (P(N)), 
which provides an indication of the probability by which a match between two nucleotide or 
amino acid sequences would occur by chance. For example, a nucleic acid is considered similar 
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to a references sequence if the smallest sum probability in a comparison of the test nucleic acid 
to the reference nucleic acid is less than about 0.2, less than about 0.0 1 , or less than about 0.001 . 

In one embodiment, protein and nucleic acid sequence homologies are evaluated 
using the Basic Local Alignment Search Tool ("BLAST"). In particular, five specific 
5 BLAST programs are used to perform the following tasks: 

(1) BLASTP and BLAST3 compare an amino acid query sequence against a protein 
sequence database; 

(2) BLASTN compares a nucleotide query sequence against a nucleotide sequence 
database; 

1 0 (3) BLASTX compares the six- frame conceptual translation products of a query 

nucleotide sequence (both strands) against a protein sequence database; 

(4) TBLASTN compares a query protein sequence against a nucleotide sequence 
database translated in all six reading frames (both strands); and 

(5) TBLASTX compares the six-frame translations of a nucleotide query sequence 
1 5 against the six-frame translations of a nucleotide sequence database. 

The BLAST programs identify homologous sequences by identifying similar 
segments, which are referred to herein as "high-scoring segment pairs," between a query 
amino or nucleic acid sequence and a test sequence which may be obtained from a protein or 
nucleic acid sequence database. High-scoring segment pairs can identified (i.e., aligned) by 

20 means of a scoring matrix, many of which are known in the art. In one embodiment, the 

scoring matrix used is the BLOSUM62 matrix (Gonnet et al., Science 256:1443-1445, 1992; 
Henikoff and Henikoff, Proteins 17:49-61, 1993). In another embodiment, the PAM or 
PAM250 matrices may also be used (Schwartz and Dayhoff, eds., "Matrices for Detecting 
Distance Relationships: Atlas of Protein Sequence and Structure" (Washington, National 

25 Biomedical Research Foundation 1978)). BLAST programs are accessible through the U.S. 
National Library of Medicine, for example, at www.ncbi.nlm.nih.gov . 

The parameters used with the above algorithms may be adapted depending on the 
sequence length and degree of homology studied. In some embodiments, the parameters may be 
the default parameters used by the algorithms in the absence of instructions from the user. 

30 As disclosed herein, clusters of stress-regulated polynucleotides (and their products), 

some of which also have been described as having cellular functions such as enzymatic activity 
or roles as transcription factors, are involved in the response of plant cells and in particular plant 
cells of cereal crops to various abiotic stresses. As such, the polynucleotides in a cluster may 
share common stress-regulated regulatory elements, including, for example, cold-regulated 
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regulatory elements, salinity-regulated regulatory elements, and osmotic pressure-regulated 
regulatory elements, as well as regulatory elements that are responsive to a combination of stress 
conditions, but not to any of the individual stress conditions, alone. The identification of such 
clusters of polynucleotides thus provides a means to identify the stress-regulated regulatory 
5 elements that control the level of expression of these nucleotide sequences. 

As used herein, the term "stress-regulated polynucleotide" or "stress-responsive . 
polynucleotide" means a polynucleotide sequence of a plant the transcription of which is 
altered, in response to exposure to a stress condition and/or the regulatory elements involved 
in the response. In general, such regulatory elements will be contained within a sequence 

10 including approximately two kilobases upstream of the transcription or translation start site 
and two kilobases downstream of the transcription or translation termination site. In the 
absence of a stress condition, the stress-regulated nucleotide sequence can normally be 
unexpressed in the cells, can be expressed at a basal level, which is induced to a higher level 
in response to the stress condition, or can be expressed at a level that is reduced in response 

1 5 to the stress condition. The coding region of a plant stress-regulated nucleotide sequence 
encodes a stress-regulated or stress-responsive polypeptide or a functional non-protein 
molecule such as a ribozyme or other functional RNA. A stress-regulated polypeptide can 
have an adaptive effect on a plant, thereby allowing the plant to better tolerate stress 
conditions; or can have a maladaptive effect, thereby decreasing the ability of the plant to 

20 tolerate the stress conditions. 

"Genome" refers to the complete genetic material of an organism, specifically a plant, 
in particular, nuclear genetic material but inclusive of plastid genetic material. 

A "functional RNA" refers to an antisense RNA, ribozyme, or other RNA that is not 
translated. 

25 The present invention provides an isolated plant stress-regulated regulatory element, 

which regulates expression of an operatively linked nucleotide sequence in a plant in 
response a stress condition. A stress-responsive or stress-regulated regulatory element is one 
that alters transcription of an operatively linked polynucleotide in response to a stress, for 
example, an abiotic stress. Such alteration includes an increase in the transcription or a 

30 decrease in transcription of the operatively linked polynucleotide. As disclosed herein, a 

plant stress-regulated regulatory element can be isolated from a polynucleotide sequence of a 
plant stress-regulated nucleotide sequence as set forth in the accompanying sequence listing 
or a functional portion of said sequence. Methods for identifying and isolating the stress- 
regulated regulatory element from the disclosed polynucleotides, or genomic DNA clones 
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corresponding thereto, are well known in the art. For example, methods of making deletion 
constructs or linker-scanner constructs can be used to identify nucleotide sequences that are 
responsive to a stress condition. Generally, such constructs include a reporter gene 
operatively linked to the sequence to be examined for regulatory activity. By performing 
5 such assays, a plant stress-regulated regulatory element can be defined within a sequence of 
about 500 nucleotides or fewer, generally at least about 200 nucleotides or fewer, particularly 
about 50 to 100 nucleotides, and more particularly at least about 20 nucleotides or fewer. In 
one embodiment, the minimal (core) sequence required for regulating a stress response of a 
plant is identified. 

10 As used herein, the term "regulatory element" or "regulatory region" means a 

nucleotide sequence that, when operatively linked to a coding region, effects transcription of 
the coding region such that a ribonucleic acid (RNA) molecule is transcribed from the coding 
region. A regulatory element generally can increase or decrease the amount of transcription 
of a nucleotide sequence, for example, a coding sequence, operatively linked to the element 

1 5 with respect to the level at which the nucleotide sequence would be transcribed absent the 
regulatory element. Regulatory elements are well known in the art and include promoters, 
enhancers, silencers, inactivated silencer intron sequences, 3-untranslated or 5-untranslated 
sequences of transcribed sequence, for example, a poly-A signal sequence, or other protein or 
RNA stabilizing elements, or other gene expression control elements known to regulate gene 

20 expression or the amount of expression of a gene product. A regulatory element can be 

isolated from a naturally occurring genomic DNA sequence or can be synthetic, for example, 
a synthetic promoter. In one embodiment, the plant stress-regulated regulatory element is a 
plant stress-regulated promoter from a cereal. 

Regulatory elements can be constitutively expressed regulatory elements, which 

25 maintain gene expression at a relative level of activity (basal level), or can be regulated 

regulatory elements. Constitutively expressed regulatory elements can be expressed in any 
cell type, or can be tissue specific, which are expressed only in particular cell types, phase 
specific, which are expressed only during particular developmental or growth stages of a 
plant cell, or the like. A regulatory element such as a tissue specific or phase specific 

30 regulatory element or an inducible regulatory element useful in constructing a recombinant 
polynucleotide or in a practicing a method of the invention can be a regulatory element that 
generally, in nature, is found in a plant genome. However, the regulatory element also can be 
from an organism other than a plant, including, for example, from a plant virus, an animal 
virus, or a cell from an animal or other multicellular organism. 
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One well-known type of regulatory element useful in the practice of the present 
invention is the promoter. Useful promoters include, but are not limited to, constitutive, 
inducible, temporally regulated, developmentally regulated, spatially-regulated, chemically 
regulated, stress-responsive, tissue-specific, viral and synthetic promoters. Promoter 

5 sequences are known to be strong or weak. A strong promoter provides for a high level of 
gene expression, whereas a weak promoter provides for a very low level of gene expression. 
An inducible promoter is a promoter that provides for the turning on and off of gene 
expression in response to an exogenously added agent, or to an environmental or 
developmental stimulus. A bacterial promoter such as the P Uc promoter can be induced to 

10 varying levels of gene expression depending on the level of isothiopropylgalactoside added to 
the transformed bacterial cells. An isolated promoter sequence that is a strong promoter for 
heterologous nucleic acid is advantageous because it provides for a sufficient level of gene 
expression to allow for easy detection and selection of transformed cells and provides for a 
high level of gene expression when desired. 

1 5 Within a plant promoter region there are several domains that are necessary for full 

function of the promoter. The first of these domains lies immediately upstream of the 
structural gene and forms the "core promoter region" containing consensus sequences, 
normally 70 base pairs immediately upstream of the gene. The core promoter region contains 
the characteristic CAAT and TATA boxes plus surrounding sequences, and represents a 

20 transcription initiation sequence that defines the transcription start point for the structural 
gene. 

The presence of the core promoter region defines a sequence as being a promoter. 
The core promoter region, however, is insufficient to provide full promoter activity. A series 
of regulatory sequences upstream of the core constitute the remainder of the promoter. These 

25 regulatory sequences determine expression level, the spatial and temporal pattern of 

expression and, for an important subset of promoters, expression under inductive conditions 
(regulation by external factors such as light, temperature, chemicals, hormones). 

To define a minimal promoter region, a DNA segment representing the promoter 
region is removed from the 5' region of the gene of interest and operably linked to the coding 

30 sequence of a marker (reporter) gene by recombinant DNA techniques well known to the art. 
The reporter gene is operably linked downstream of the promoter, so that transcripts initiating 
at the promoter proceed through the reporter gene. Reporter genes generally encode proteins 
that are easily measured, including, but not limited to, chloramphenicol acetyl transferase 
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(CAT), beta-glucuronidase (GUS), green fluorescent protein (GFP), beta-galactosidase (beta- 
GAL), and luciferase. 

The construct containing the reporter gene under the control of the promoter is then 
introduced into an appropriate cell type by transfection techniques well known to the art. In 
5 one embodiment, cell lysates are prepared and appropriate assays, which are well known in 
the art, for the reporter protein are performed. For example, if CAT were the reporter gene of 
choice, the lysates from cells transfected with constructs containing CAT under the control of 
a promoter under study are mixed with isotopically labeled chloramphenicol and acetyl- 
coenzyme A (acetyl-CoA). The CAT enzyme transfers the acetyl group from acetyl-CoA to 

10 the 2- or 3-position of chloramphenicol. The reaction is monitored by thin-layer 

chromatography, which separates acetylated chloramphenicol from unreacted material. The 
reaction products are then visualized by autoradiography. 

The level of enzyme activity corresponds to the amount of enzyme that was made, 
which in turn reveals the level of expression from the promoter of interest. This level of 

1 5 expression can be compared to other promoters to determine the relative strength of the 
promoter under study. In order to be sure that the level of expression is determined by the 
promoter, rather than by the stability of the mRNA, the level of the reporter mRNA can be 
measured directly, such as by Northern blot analysis. 

Once activity is detected, mutational and/or deletionai analyses may be employed to 

20 determine the minimal region and/or sequences required to initiate transcription. Thus, 
sequences can be deleted at the 5* end of the promoter region and/or at the 3' end of the 
promoter region, and nucleotide substitutions introduced. These constructs are then 
introduced to cells and their activity determined. 

The choice of promoter will vary depending on the temporal and spatial requirements 
25 for expression, and also depending on the target species. In some cases, expression in 

multiple tissues is desirable. While in others, tissue-specific, e.g., leaf-specific, seed-specific, 
petal-specific, anther-specific, or pith-specific, expression is desirable. Although many 
promoters from dicotyledons have been shown to be operational in monocotyledons and vice 
versa, typically dicotyledonous promoters are selected for expression in dicotyledons, and 
30 monocotyledonous promoters for expression in monocotyledons. There is, however, no 
restriction to the provenance of setected'promoters. It is sufficient that the promoters are- 
operational in driving the expression of the nucleotide sequences in the desired cell. 
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A range of naturally-occurring promoters are known to be operative in plants and 
have been used to drive the expression of heterologous (both foreign and endogenous) genes 
and nucleotide sequences in plants: for example, the constitutive 35S cauliflower mosaic 
virus (CaMV) promoter, the ripening-enhanced tomato polygalacturonase promoter (Schuch 
5 et al., Plant Mol. Biol.. 13:303, 1989), the E8 promoter (Diekman & Fischer, EMBQ J., 
7:3315 1988) and the fruit specific 2A1 promoter (Pear et al., Plant Mol. Biol.. 13:639, 
1989). Many other promoters, e.g., U2 and U5 snRNA promoters from maize, the promoter 
from alcohol dehydrogenase, the Z4 promoter from a gene encoding the Z4 22 kD zein 
protein, the Z10 promoter from a gene encoding a 10 kD zein protein, a Z27 promoter from a 

10 gene encoding a 27 kD zein protein, the A20 promoter from the gene encoding a 19 kD -zein 
protein, inducible promoters, such as the light inducible promoter derived from the pea rbcS 
gene and the actin promoter from rice, e.g., the actin 2 promoter (WO 00/70067); seed 
specific promoters, such as the phaseolin promoter from beans, may also be used. The stress- 
regulated nucleotide sequences of this invention can also be expressed under the regulation of 

1 5 promoters that are chemically regulated. This enables the nucleic acid sequence or encoded 
polypeptide to be synthesized only when the crop plants are treated with the inducing 
chemicals. Chemical induction of gene expression is detailed in EP 0 332 104 and U.S. 
Patent 5,614,395. 

In some instances it may be desirable to link a constitutive promoter to the stress 
20 regulated nucleotide sequences of the present invention. Examples of some constitutive 

promoters which have been described include the rice actin 1 (Wang et al., Molec. Cell BioL 
12:3399, 1992; U.S. Patent No. 5,641,876), CaMV 35S (Odell et al., Nature, 313:810, 1985), 
CaMV 19S (Lawton et al, Mol. Cell Biol.. 7:335, 1987), nos> Adh, sucrose synthase; and the 
ubiquitin promoters. 

25 In other situations it may be desirable to limit expression of stress-related sequences 

to specific tissues or stages of development. As used herein, the term "tissue specific or 
phase specific regulatory element" means a nucleotide sequence that effects transcription in 
only one or a few cell types, or only during one or a few stages of the life cycle of a plant, for 
example, only for a period of time during a particular stage of growth, development or 

30 differentiation. The terms "tissue specific" and "phase specific" are used together herein in 
referring to a regulatory element because a single regulatory element can have characteristics 
of both types of regulatory elements. For example, a regulatory element active only during a 
particular stage of plant development also can be expressed only in one or a few types of cells 
in the plant during the particular stage of development. As such, any attempt to classify such 
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regulatory elements as tissue specific or as phase specific can be difficult. Accordingly, 
unless indicated otherwise, all regulatory elements having the characteristic of a tissue 
specific regulatory element, or a phase specific regulatory element, or both are considered 
together for purposes of the present invention. 
5 Examples of tissue specific promoters which have been described include the lectin 

(Vodkin. Prog. Clin Biol. Res.. 138:87. 1983; Lindstrom et al.. Per. Genet.. 11:160, 1990) 
corn alcohol dehydrogenase 1 (Vogel et al., EMBO J.. 11:157, 1992; Dennis et al., Nuc. Acid 
Res.. 12:3983, 1984), corn light harvesting complex (Bansal et al., Proc. Natl. Acad. Sci. 
USA . 89:3654, 1992), com heat shock protein (Odell et al, Nature . 313:810, 1985), pea 

10 small subunit RuBP carboxylase (Poulsen et al., Mol. Gen. Genet. . 205:193, 1986), Ti 

plasmid mannopine synthase (Langridge et al., Proc. Natl. Acad. Sci., USA, 86:3219, 1989), 
Ti plasmid nopaline synthase (Langridge et al., Proc. Natl. Acad. Sci.. USA . 86:3219, 1989), 
petunia chalcone isomerase (vanTunen et al., EMBO J.. 7:1257, 1988), bean glycine rich 
protein 1 (Keller et al., Genes Dev.. 3:1639, 1989), truncated CaMV 35s (Odell et al., Nature . 

15 313:810, 1985), potato patatin (Wenzler et al., Plant Molec. Biol.. 13:347, 1989), root cell 
(Yamamoto et al., Nuc. Acid Res. . 18:7449, 1990), maize zein (Reina et al., Nuc. Acids Res., 
18:6425-26, 1990; Kriz et al., Mol. Gen. Genet. . 207:90, 1987; Wandelt et al., Nuc. Acids 
Res. . 17:2354, 1989; Langridge et al., CeU, 34:1015, 1983; Reina et al., Nuc. Acids Res.. 
18:6425-26 1990), globulin-1 (Belanger et al., Genetics . 129:863, 1991), a-tubulin, cab 

20 (Sullivan et al., Mol. Gen. Genet. . 215:431, 1989), PEPCase (Hudspeth & Grula, Plant 
Molec. Biol.. 12:579, 1989), R gene complex-associated promoters (Chandler et al., Plant 
Cell 1:1175, 1989), histone, and chalcone synthase promoters (Franken et al., EMBO J.. 
10:2605, 1991). Tissue specific enhancers are described in Fromm et al. Bio/Technology, 
8:833, 1989. 

25 Several other tissue-specific regulated genes and/or promoters have been reported in 

plants. These include genes encoding the seed storage proteins (such as napin, cruciferin, 
beta-conglycinin, and phaseolin) zein or oil body proteins (such as oleosin), or genes 
involved in fatty acid biosynthesis (including acyl carrier protein, stearoyl-ACP desaturase, 
fatty acid desaturases (fad 2-1)), and other genes expressed during embryo development 

30 (such as Bce4, see, for example, EP 255378 and Kridl et al., Seed Sci. Res. . 1 :209, 1991). 
Particularly useful for seed-specific expression is the pea vicilin promoter (Czako et al., Mol. 
Gen. Genet. . 235:33, 1992). (See also U.S. Pat. No. 5,625,136, herein incorporated by 
reference.) Other useful promoters for expression in mature leaves are those that are 
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switched on at the onset of senescence, such as the SAG promoter from Arabidopsis (Gan et 
al., Science, 270:1986, 1995). 

A class of fruit-specific promoters expressed at or during antithesis through fruit 
development, at least until the beginning of ripening, is discussed in U.S. 4,943,674. cDNA 
5 clones that are preferentially expressed in cotton fiber have been isolated (John et al., Proc. 
Natl. Acad. Sci. USA . 89:5769, 1992). cDNA clones from tomato displaying differential 
expression during fruit development have been isolated and characterized (Mansson et al, 
Gen. Genet. . 200:356, 1985, Slater et al., Plant Molec. Biol.. 5:137, 1985). The promoter for 
polygalacturonase gene is active in fruit ripening. The polygalacturonase gene is described in 

10 U.S. Patent No. 4,535,060, U.S. Patent No. 4,769,061, U.S. Patent No. 4,801,590, and U.S. 
Patent No. 5,107,065, which disclosures are incorporated herein by reference. 

Other examples of tissue-specific promoters include those that direct expression in 
leaf cells following damage to the leaf (for example, from chewing insects), in tubers (for 
example, patatin gene promoter), and in fiber cells (an example of a developmentally- 

1 5 regulated fiber cell protein is E6 (John et al., Proc. Natl. Acad. Sci. USA. 89:5769, 1 992). 
The E6 gene is most active in fiber, although low levels of transcripts are found in leaf, ovule 
and flower. 

Additional tissue specific or phase specific regulatory elements include, for example, 
the AGL8/FRUITFULL regulatory element, which is activated upon floral induction (Hempel 

20 et al., Development 124:3845-3853, 1997) root specific regulatory elements such as the 
regulatory elements from the RCP1 gene and the LRP1 gene (Tsugeki and Fedoroff, Proc. 
Natl. Acad.. USA 96:12941-12946. 1999; Smith and FedorofY. Plant Cell 7:735, 1995); 
flower specific regulatory elements such as the regulatory elements from the LEAFY gene 
and the APETELA 1 gene (Blazquez et al., Development 124:3835-3844, 1997; Hempel et al., 

25 supra, 1997); seed specific regulatory elements such as the regulatory element from the 

oleosin gene (Plant et al., Plant Mol. Biol. 25:193-205, 1994), and dehiscence zone specific 
regulatory element. Additional tissue specific or phase specific regulatory elements include 
the Znl3 promoter, which is a pollen specific promoter (Hamilton et al., Plant Mol. Biol. 
18:21 1-218, 1992); the UNUSUAL FLORAL ORGANS (UFO) promoter, which is active in 

30 apical shoot meristem; the promoter active in shoot meristems (Atanassova et al., Plant J. 
2:291, 1992), the cdc2a promoter and cyc07 promoter (see, for example, Ito et al., Plant Mol. 
Biol. 24:863, 1994; Martinez et al., Proc. Natl. Acad. Sci.. USA 89:7360, 1992; Medford et 
al., Plant Cell 3:359, 1991; Terada et al., Plant J. 3:241, 1993; Wissenbach et al., Plant J. 
4:411, 1993); the promoter of the APETELA3 gene, which is active in floral meristems (Jack 
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et al. f Ceil 76:703, 1994; Hempel et al., supra, 1997); a promoter of an agamous-like (AGL) 
family member, for example, AGL8, which is active in shoot meristem upon the transition to 
flowering (Hempel et al., supra, 1997); floral abscission zone promoters; LI -specific 
promoters; and the like. 

5 The tissue-specificity of some "tissue-specific" promoters may not be absolute and 

may be tested by one skilled in the art using the diphtheria toxin sequence. One can also 
achieve tissue-specific expression with "leaky" expression by a combination of different 
tissue-specific promoters (Beals et al., Plant Cell . 9:1527, 1997). Other tissue-specific 
promoters can be isolated by one skilled in the art (see U.S. 5,589,379). Several inducible 

10 promoters ("gene switches'*) have been reported, many of which are described in the review 
by Gatz (Cur. Opin. Biotech , 7:168, 1996) and Gatz (Ann. Rev. Plant. Physiol. Plant Mol. 
Biol., 48:89, 1997). These include tetracycline repressor system, Lac repressor system, 
copper-inducible systems, salicylate-inducible systems (such as the PRla system), 
glucocorticoid- (Aoyama et al., N-H Plant J.. 1 1 :605, 1997) and ecdysome-inducible systems. 

1 5 Also included are the benzene sulphonamide- (U.S. Patent No. 5,364,780) and alcohol-(WO 
97/06269 and WO 97/06268) inducible systems and glutathione S-transferase promoters. 

In some instances it might be desirable to inhibit expression of a native DNA 
sequence within a plant's tissues to achieve a desired phenotype. In this case, such inhibition 
might be accomplished with transformation of the plant to comprise a constitutive, tissue- 

20 independent promoter operably linked to an antisense nucleotide sequence, such that 

constitutive expression of the antisense sequence produces an RNA transcript that interferes 
with translation of the mRNA of the native DNA sequence. 

Inducible regulatory elements also are useful for purposes of the present invention. 
As used herein, the term "inducible regulatory element" means a regulatory element that, 

25 when exposed to an inducing agent, effects an increased level of transcription of a nucleotide 
sequence to which it is operatively linked as compared to the level of transcription, if any, in 
the absence of an inducing agent. Inducible regulatory elements can be those that have no 
basal or constitutive activity and only effect transcription upon exposure to an inducing 
agent, or those that effect a basal or constitutive level of transcription, which is increased 

30 upon exposure to an inducing agent. Inducible regulatory elements that effect a basal or 
constitutive level of expression generally are useful in a method or composition of the 
invention where the induced level of transcription is substantially greater than the basal or 
constitutive level of expression, for example, at least about two-fold greater, or at least about 
five-fold greater. Particularly useful inducible regulatory elements do not have a basal or 
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constitutive activity, or increase the level of transcription at least about ten-fold greater than a 
basal or constitutive level of transcription associated with the regulatory element. 

Inducible promoters that have been described include the ABA- and turgor-inducible 
promoters, the promoter of the auxin-binding protein gene (Schwob et al., Plant J. . 4:423, 
5 1993), the UDP glucose flavonoid glycosyl-transferase gene promoter (Ralston et al., 
Genetics , 1 19: 1 85, 1 988), the MPI proteinase inhibitor promoter (Cordero et al., Plant J.. 
6:141, 1994), and the glyceraldehyde-3-phosphate dehydrogenase gene promoter (Kohler et 
al., Plant Molec. BioL 29:1293, 1995; Quigley et al., J. Mol. EvoU 29:412, 1989; Martinez 
et al., J. Mol Biol.. 208:551, 1989). 

1 0 The term "inducing agent" is used to refer to a chemical, biological or physical agent 

or environmental condition that effects transcription from an inducible regulatory element. In 
response to exposure to an inducing agent, transcription from the inducible regulatory 
element generally is initiated de novo or is increased above a basal or constitutive level of 
expression. Such induction can be identified using the methods disclosed herein, including 

1 5 detecting an increased level of RNA transcribed from a nucleotide sequence operatively 
linked to the regulatory element, increased expression of a polypeptide encoded by the 
nucleotide sequence, or a phenotype conferred by expression of the encoded polypeptide. 

An inducing agent useful in a method of the invention is selected based on the 
particular inducible regulatory element. For example, the inducible regulatory element can 

20 be a metallothionein regulatory element, a copper inducible regulatory element or a 

tetracycline inducible regulatory element, the transcription from which can be effected in 
response to metal ions, copper or tetracycline, respectively (Furst et al., Cell 55:705-717, 
1988; Mett et al., Proc. Natl. Acad. ScL USA 90:4567-4571, 1993; Gatz et al, Plant J. 2:397- 
404, 1992; Roder et al, Mol. Gen. Genet. 243:32-38, 1994). The inducible regulatory 

25 element also can be an ecdysone regulatory element or a glucocorticoid regulatory element, 
the transcription from which can be effected in response to ecdysone or other steroid 
(Christopherson et al., Proc. Natl. Acad. ScL USA 89:6314-6318, 1992; Schena et al, Proc. 
Natl. Acad. Sci.« USA 88:10421-10425, 1991). In addition, the regulatory element can be a 
cold responsive regulatory element or a heat shock regulatory element, the transcription of 

30 which can be effected in response to exposure to cold or heat, respectively (Takahashi et al., 
Plant Phvsiol. 99:383-390, 1992). Additional regulatory elements useful in the methods or 
compositions of the invention include, for example, the spinach nitrite reductase gene 
regulatory element (Back et al., Plant Mol. Biol. 17:9, 1991); a light inducible regulatory 
element (Feinbaum et al., Mol. Gen. Genet. 226:449, 1991; Lam and Chua, Science 248:471, 
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1990), a plant hormone inducible regulatory element (Yamaguchi-Shinozaki et al., Plant Mol. 
Biol 15:905, 1990; Kares et al., Plant Mol. Biol. 15:225, 1990), and the like. 

An inducible regulatory element also can be a plant stress-regulated regulatory 
element of the invention. In addition to the known stress conditions that specifically induce 
5 or repress expression from such elements, the present invention provides methods of 
identifying agents that mimic a stress condition. Accordingly, such stress mimics are 
considered inducing or repressing agents with respect to a plant stress-regulated regulatory 
element. In addition, a recombinant polypeptide comprising a zinc finger domain, which is 
specific for the regulatory element, and an effector domain, particularly an activator, can be 

10 useful as an inducing agent for a plant stress-regulated regulatory element. Furthermore, 
such a recombinant polypeptide provides the advantage that the effector domain can be a 
repressor domain, thereby providing a repressing agent, which decreases expression from the 
regulatory element. In addition, use of such a method of modulating expression of an 
endogenous plant stress-regulated nucleotide sequence provides the advantage that the 

1 5 polynucleotide encoding the recombinant polypeptide can be introduced into cells of the 
plant, thus providing a transgenic plant that can be regulated coordinately with the 
endogenous plant stress-regulated nucleotide sequence upon exposure to a stress condition. 
A polynucleotide encoding such a recombinant polypeptide can be operatively linked to and 
expressed from a constitutively active, inducible or tissue specific or phase specific 

20 regulatory element. 

In one embodiment, the promoter may be a gamma zein promoter, an oleosin olel6 
promoter, a globulin! promoter, an actin I promoter, an actin cl promoter, a sucrose 
synthetase promoter, an INOPS promoter, an EXM5 promoter, a globulin2 promoter, a b-32, 
ADPG- pyrophosphorylase promoter, an Ltpl promoter, an Ltp2 promoter, an oleosin olel7 

25 promoter, an oleosin ole 1 8 promoter, an actin 2 promoter, a pollen-specific protein promoter, 
a pollen-specific pectate lyase promoter, an anther-specific protein promoter, an anther- 
specific gene RTS2 promoter, a pollen- specific gene promoter, a tapeturn-specific gene 
promoter, tapetum- specific gene RAB24 promoter, a anthranilate synthase alpha subunit 
promoter, an alpha zein promoter, an anthranilate synthase beta subunit promoter, a 

30 dihydrodipicolinate synthase promoter, a Thil promoter, an alcohol dehydrogenase promoter, 
a cab binding protein promoter, an H3C4 promoter, a RUBISCO SS starch branching enzyme 
promoter, an ACCase promoter, an actin3 promoter, an actin7 promoter, a regulatory protein 
GF14-12 promoter, a ribosomal protein L9 promoter, a cellulose biosynthetic enzyme 
promoter, an S-adenosyl-L-homocysteine hydrolase promoter, a superoxide dismutase 
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promoter, a C-kinase receptor promoter, a phosphoglycerate mutase promoter, a root-specific 
RCc3 mRNA promoter, a glucose-6 phosphate isomerase promoter, a pyrophosphate-fructose 
6-phosphatelphosphotransferase promoter, an ubiquitin promoter, a bela-ketoacyl-ACP 
synthase promoter, a 33 kDa photosystem 1 1 promoter, an oxygen evolving protein 
5 promoter, a 69 kDa vacuolar ATPase subunit promoter, a metallothionein-like protein 
promoter, a glyceraldehyde-3 -phosphate dehydrogenase promoter, an ABA- and ripening- 
inducible-like protein promoter, a phenylalanine ammonia lyase promoter, an adenosine 
triphosphatase S-adenosyl-Uiomocysteine hydrolase promoter, an a- tubulin promoter, a cab 
promoter, a PEPCase promoter, an R gene promoter, a lectin promoter, a light harvesting 

10 complex promoter, a heat shock protein promoter, a chalcone synthase promoter, a zein 
promoter, a globulin-1 promoter, an ABA promoter, an auxin-binding protein promoter, a 
UDP glucose flavonoid glycosyl-transferase gene promoter, an NTI promoter, an actin 
promoter, an opaque 2 promoter, a b70 promoter, an oleosin promoter, a CaMV 35S 
promoter, a CaMV 1 9S promoter, a histone promoter, a turgor-inducible promoter, a pea 

1 5 small subunit RuBP carboxylase promoter, a Ti plasmid mannopine synthase promoter, Ti 
plasmid nopaline synthase promoter, a petunia chalcone isomerase promoter, a bean glycine 
rich protein I promoter, a CaMV 35S transcript promoter, a potato patatin promoter, or a S- 
E9 small subunit RuBP carboxylase promoter. 

In addition to promoters, a variety of 5' and 3' transcriptional regulatory sequences are 

20 also available for use in the present invention. Transcriptional terminators are responsible for 
the termination of transcription and correct mRNA polyadenylation. The 3 f nontranslated 
regulatory DNA sequence usually includes from about 50 to about 1 ,000, typically about 100 
to about 1,000, nucleotide base pairs and contains plant transcriptional and translational 
termination sequences. Appropriate transcriptional terminators and those which are known to 

25 function in plants include the CaMV 35S terminator, the tml terminator, the nopaline 

synthase terminator, the pea rbcS E9 terminator, the terminator for the T7 transcript from the 
octopine synthase gene of Agrobacterium tumefaciens, and the 3' end of the protease inhibitor 
I or II genes from potato or tomato, although other 3' elements known to those of skill in the 
art can also be employed. Alternatively, one also could use a gamma coixin, oleosin 3 or 

30 other terminator from the genus Coix. 

Suitable 3' elements include those from the nopaline synthase gene of Agrobacterium 
tumefaciens (Bevan et al., Nature, 304:184, 1983), the terminator for the T7 transcript from 
the octopine synthase gene of Agrobacterium tumefaciens, and the 3* end of the protease 
inhibitor I or II genes from potato or tomato. 
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As the DNA sequence between the transcription initiation site and the start of the 
coding sequence, i.e., the untranslated leader sequence, can influence gene expression, one 
may also wish to employ a particular leader sequence. Suitable leader sequences are 
contemplated to include those that comprise sequences predicted to direct optimum 
5 expression of the attached sequence, i.e., to include a consensus leader sequence that may 
increase or maintain mRNA stability and prevent inappropriate initiation of translation. The 
choice of such sequences will be known to those of skill in the art in light of the present 
disclosure. Sequences that are derived from genes that are highly expressed in plants are 
desirable. 

10 Other sequences that have been found to enhance gene expression in transgenic plants 

include intron sequences (e.g., from Adhl, bronzel, actinl. actin 2 (WO 00/760067), or the 
sucrose synthase intron) and viral leader sequences (e.g., from TMV, MCMV and AMV). 
For example, a number of non-translated leader sequences derived from viruses are known to 
enhance expression. Specifically, leader sequences from Tobacco Mosaic Virus (TMV), 

1 5 Maize Chlorotic Mottle Virus (MCMV), and Alfalfa Mosaic Virus (AMV) have been shown 
to be effective in enhancing expression (e.g., Gallie et al., Nuc. Acids Res., 15:8693, 1987; 
Skuzeski et al., Plant Mol.Biol. . 15:65, 1990). Other leaders known in the art, include but 
are not limited to: Picornavirus leaders, for example, EMCV leader (Encephalomyocarditis 5 
noncoding region) (Elroy-Stein et al., Proc. Natl. Acad. Sci. USA, 86:6126, 1989); Potyvirus 

20 leaders, for example, TEV leader (Tobacco Etch Virus); MDMV leader (Maize Dwarf 

Mosaic Virus); Human immunoglobulin heavy-chain binding protein (BiP) leader, (Macejak 
et al., Nature, 353:90, 1991); Untranslated leader from the coat protein mRNA of alfalfa 
mosaic virus (AMV RNA 4), (Jobling et al., Nature. 325:622, 1987; Tobacco mosaic virus 
leader (TMV), (Gallie et al., Molecular Biology of RNA, 237 1989; and Maize Chlorotic 

25 Mottle Virus leader (MCMV) (Lommel et al., Virology, 81:382, 1991 . See also, Delia- 
Cioppa et al., Plant Physiol. . 84:965, 1987. 

Regulatory elements such as Adh intron 1 (Callis et al., Genes DeveL 1:1183,1 987), 
sucrose synthase intron (Vasil et al., Plant Physiol.. 91 : 1575, 1989) or TMV omega element 
(Gallie, et al., Molecular Biology of RNA. 237 1989 1989), may further be included where 

30 desired. 

Examples of enhancers include elements from the CaMV 35S promoter, octopine 
synthase genes (Ellis el al., EMBO J. . 6:3203, 1987), the rice actin I gene, the maize alcohol 
dehydrogenase gene (Callis et al, Genes Devel.. 1 :1 183, 1987), the maize shrunken I gene 
(Vasil et al., Plant Physiol. . 91:1575, 1989), TMV Omega element (Gallie et al., Molecular 
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Biology of RNA . 1989) and promoters from non-plant eukaryotes (e.g. yeast; Ma et al., 
Nature, 334:631, 1988). 

Vectors for use in accordance with the present invention may be constructed to 
include the ocs enhancer element. This element was first identified as a 16 bp palindromic 
5 enhancer from the octopine synthase (ocs) gene of ultilane (Ellis et al., EMBOJ., 6:3203, 
1987), and is present in at least 10 other promoters (Bouchez et aL, EMBO J.. 8:4197, 1989). 
The use of an enhancer element, such as the ocs element and particularly multiple copies of 
the element, will act to increase the level of transcription from adjacent promoters when 
applied in the context of monocot transformation. 

10 The methods of the invention provide genetically modified plant cells, which can 

contain, for example, a coding region, or functional portion thereof, of a plant stress- 
regulated polynucleotide operatively linked to a heterologous inducible regulatory element; 
or a plant stress-regulated regulatory element operatively linked to a heterologous nucleotide 
sequence encoding a polypeptide of interest. In such a plant, the expression from the 

15 inducible regulatory element can be effected by exposing the plant cells to an inducing agent 
in any of numerous ways depending, for example, on the inducible regulatory element and 
the inducing agent. For example, where the inducible regulatory element is a cold responsive 
regulatory element present in the cells of a transgenic plant, the plant can be exposed to cold 
conditions, which can be produced artificially, for example, by placing the plant in a 

20 thermostatically controlled room, or naturally, for example, by planting the plant in an 
environment characterized, at least in part, by attaining temperatures sufficient to induce 
transcription from the promoter but not so cold as to kill the plants. By examining the 
phenotype of such transgenic plants, those plants that ectopically express a gene product that 
confers increased resistance of the plant to cold can be identified. Similarly, a transgenic 

25 plant containing a metallothionein promoter can be exposed to metal ions such as cadmium 
or copper by watering the plants with a solution containing the inducing metal ions, or can be 
planted in soil that is contaminated with a level of such metal ions that is toxic to most plants. 
The phenotype of surviving plants can be observed, those expressing desirable traits can be 
selected. 

30 As used herein, the term "phenotype" refers to a physically detectable characteristic. 

A phenotype can be identified visually by inspecting the physical appearance of a plant 
following exposure; for example, to increased osmotic conditions; can be identified using an 
assay to detecting a product produced due to expression of reporter gene, for example, an 
RNA molecule, a polypeptide such as an enzyme, or other detectable signal such as disclosed 
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herein; or by using any appropriate tool useful for identifying a phenotype of a plant, for 
example, a microscope, a fluorescence activated cell sorter, or the like. 

A transgenic plant containing an inducible regulatory element such as a steroid 
inducible regulatory element can be exposed to a steroid by watering the plants with a 
5 solution containing the steroid- The use of an inducible regulatory element that is induced 
upon exposure to a chemical or biological inducing agent that can be placed in solution or 
suspension in an aqueous medium can be particularly useful because the inducing agent can 
be applied conveniently to a relatively large crop of transgenic plants containing the inducible 
regulatory element, for example, through a watering system or by spraying the inducing 

10 agent over the field. As such, inducible regulatory elements that are responsive to an 

environmental inducing agent, for example, cold; heat; metal ions or other potentially toxic 
agents such as a pesticides, which can contaminate a soil; or the like; or inducible regulatory 
elements that are regulated by inducing agents that conveniently can be applied to plants, can 
be particularly useful in a method or composition of the invention, and allow the 

15 identification and selection of plants that express desirable traits and survive and grow in 
environments that otherwise would not support growth of the plants. 

For purposes of modulating the responsiveness of a plant to a stress condition, it can 
be useful to introduce a modified plant stress-regulated regulatory element into a plant. Such 
a modified regulatory element can have any desirable characteristic, for example, it can be 

20 inducible to a greater level than the corresponding wild-type promoter, or it can be 

inactivated such that, upon exposure to a stress, there is little or no induction of expression of 
a nucleotide sequence operatively linked to the mutant element. A plant stress-regulated 
regulatory element can be modified by incorporating random mutations using, for example, 
in vitro recombination or DNA shuffling (Stemmer et al., Nature 370: 389-391, 1994; U.S. 

25 Patent No. 5,605,793). Using such a method, millions of mutant copies of the 

polynucleotide, for example, stress-regulated regulatory element, can be produced based on 
the original nucleotide sequence, and variants with improved properties, such as increased 
inducibility can be recovered. 

A mutation method such as DNA shuffling encompasses forming a mutagenized 

30 double-stranded polynucleotide from a template double-stranded polynucleotide, wherein the 
template double-stranded polynucleotide has been cleaved into double stranded random 
fragments of a desired size, and comprises the steps of adding to the resultant population "of " 
double-stranded random fragments one or more single or double stranded oligonucleotides, 
wherein the oligonucleotides comprise an area of identity and an area of heterology to the 
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double stranded template polynucleotide; denaturing the resultant mixture of double stranded 
random fragments and oligonucleotides into single stranded fragments; incubating the 
resultant population of single stranded fragments with a polymerase under conditions that 
result in the annealing of the single stranded fragments at the areas of identity to form pairs 
5 of annealed fragments, the areas of identity being sufficient for one member of a pair to 
prime replication of the other, thereby foiming a mutagenized double-stranded 
polynucleotide; and repeating the second and third steps for at least two further cycles, 
wherein the resultant mixture in the second step of a further cycle includes the mutagenized 
double-stranded polynucleotide from the third step of the previous cycle, and the further 

1 0 cycle forms a further mutagenized double-stranded polynucleotide. Typically, the 

concentration of a single species of double stranded random fragment in the population of 
double stranded random fragments is less than 1% by weight of the total DNA. In addition, 
the template double stranded polynucleotide can comprise at least about 100 species of 
polynucleotides. The size of the double stranded random fragments can be from about 5 base 

15 pairs to 5 kilobase pairs. In a further embodiment, the fourth step of the method comprises 
repeating the second and the third steps for at least 10 cycles. 

A plant stress-regulated regulatory element of the invention is useful for expressing a 
nucleotide sequence operatively linked to the element in a cell, particularly a plant cell. As 
used herein, the term "expression" refers to the transcription and/or translation of an 

20 endogenous gene or a transgene in plants. In the case of an antisense molecule, for example, 
the term "expression" refers to the transcription of the polynucleotide encoding the antisense 
molecule. 

As used herein, the term "operatively linked," when used in reference to a plant 
stress-regulated regulatory element, means that the regulatory element is positioned with 

25 respect to a second nucleotide sequence such that the regulatory element effects transcription 
or transcription and translation of the nucleotide sequence in substantially the same manner, 
but not necessarily to the same extent, as it does when the regulatory element is present in its 
natural position in a genome. Transcriptional promoters, for example, generally act in a 
position and orientation dependent manner and usually are positioned at or within about five 

30 nucleotides to about fifty nucleotides 5' (upstream) of the start site of transcription of a gene 
in nature. In comparison, enhancers and silencers can act in a relatively position or 
orientation independent manner and, therefore, can be positioned several hundred or thousand 
nucleotides upstream or downstream from a transcription start site, or in an intron within the 
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coding region of a gene, yet still be operatively linked to a coding region so as to effect 
transcription. 

The second nucleotide sequence, i.e., the sequence operatively linked to the plant 
stress-regulated regulatory element, can be any nucleotide sequence, including, for example, 
5 a coding region of a gene or cDNA; a sequence encoding an antisense molecule, an RNAi 
molecule, ribozyme, triplexing agent (see, for example, Frank-Kamenetskii and Mirkin, Ann. 
Rev. Biochem. 64:65-95, 1 995), or the like; or a sequence that, when transcribed, can be 
detected in the cell using, for example, by hybridization or amplification, or when translated 
produces a detectable signal. The term "coding region" is used broadly herein to include a 

10 nucleotide sequence of a genomic DNA or a cDNA molecule comprising all or part of a 
coding region of the coding strand. A coding region can be transcribed from an operatively 
linked regulatory element, and can be translated into a full-length polypeptide or a peptide 
portion of a polypeptide, preferably a peptide portion having the same functional 
characteristics as the full-length polypeptide. Tt should be recognized that, in a nucleotide 

15 sequence comprising a coding region, not all of the nucleotides in the sequence need 

necessarily encode the polypeptide and, particularly, that a gene transcript can contain one or 
more introns, which do not encode an amino acid sequence of a polypeptide but, 
nevertheless, are part of the coding region, particularly the coding strand, of the gene. 

The present invention also relates to a recombinant polynucleotide, which contains a 

20 functional portion of a plant stress-regulated nucleotide sequence operatively linked to a 
heterologous nucleotide sequence. As used herein, the term "functional portion 1 * of plant 
stress-regulated sequence means a contiguous nucleotide sequence of the plant stress- 
regulated sequence that provides a function within a plant or plant cell. The portion can be 
any portion of the sequence, particularly a coding sequence, or a sequence encoding a peptide 

25 portion of the stress-regulated polypeptide; the stress-regulated regulatory element such as a 
promoter or minimal promoter; a sequence useful as an antisense molecule or triplexing 
agent; or a sequence useful for disrupting (knocking-out) an endogenous plant stress- 
regulated nucleotide sequence. 

A heterologous nucleotide sequence is a nucleotide sequence that is not normally part 

30 of the plant stress-regulated polynucleotide from which the functional portion of the plant 

stress-regulated polynucleotide-component of the recombinant polynucleotide is obtained; or, 
if it is a part of the plant stress-regulated polynucleotide sequence from which the functional 
portion is obtained, it is an orientation other than it would normally be in, for example, is an 
. antisense sequence, or comprises at least partially discontinuous as compared to the genomic 
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structure, for example, a single exon operatively linked to the regulatory element. In general, 
where the functional portion of the plant stress-regulated nucleotide sequence comprises the 
coding sequence in a recombinant polynucleotide of the invention, the heterologous 
nucleotide sequence will function as a regulatory element. The regulatory element can be 
5 any heterologous regulatory element, including, for example, a constitutively active 

regulatory element, an inducible regulatory element, or a tissue specific or phase specific 
regulatory element, as disclosed above. Conversely, where the functional portion of the plant 
stress-regulated polynucleotide comprises the stress-regulated regulatory element of a 
recombinant polynucleotide of the invention, the heterologous nucleotide sequence generally 

1 0 will be a nucleotide sequence that can be transcribed and, if desired, translated. Where the 
heterologous nucleotide sequence is expressed from a plant stress-regulated regulatory 
element, it generally confers a desirable phenotype to a plant cell containing the recombinant 
polynucleotide, or provides a means to identify a plant cell containing the recombinant 
polynucleotide. It should be recognized that a "desirable" phenotype can be one that 

1 5 decreases the ability of a plant cell to compete where the plant cell, or a plant containing the 
cell, is an undesired plant cell. Thus, a heterologous nucleotide sequence can allow a plant to 
grow, for example, under conditions in which it would not normally be able to grow. 

A heterologous nucleotide sequence can be, or encode, a selectable marker. As used 
herein, the term "selectable marker" is used herein to refer to a molecule that, when present 

20 or expressed in a plant cell, provides a means to identify a plant cell containing the marker. 
As such, a selectable marker can provide a means for screening a population of plants, or 
plant cells, to identify those having the marker. A selectable marker also can confer a 
selective advantage to the plant cell, or a plant containing the cell. The selective advantage 
can be, for example, the ability to grow in the presence of a negative selective agent such as 

25 an antibiotic or herbicide, compared to the growth of plant cells that do not contain the 

selectable marker. The selective advantage also can be due, for example, to an enhanced or 
novel capacity to utilize an added compound as a nutrient, growth factor or energy source. A 
selectable advantage can be conferred, for example, by a single polynucleotide, or its 
expression product, or to a combination of polynucleotides whose expression in a plant cell 

30 gives the cell with a positive selective advantage, a negative selective advantage, or both. 

Examples of selectable markers include those that confer antimetabolite resistance, 
for example, dihydrofolate reductase, which confers resistance to methotrexate (Reiss, Plant 
Physiol. (Life Sci. Adv.) 13:143-149, 1994); neomycin phosphotransferase, which confers 
resistance to the aminoglycosides neomycin, kanamycin and paromycin (Herrera-Estrella, 
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EMBO J. 2:987-995, 1983) and hygro, which confers resistance to hygromycin (Marsh, Gene 
32:481-485, 1984), trpB, which allows cells to utilize indole in place of tryptophan; hisD, 
which allows cells to utilize histinol in place of histidine (Hartman, Proc. Natl. Acad. ScL 
USA 85:8047, 1988); mannose-6-phosphate isomerase which allows cells to utilize mannose 
5 (WO 94/20627); ornithine decarboxylase, which confers resistance to the ornithine 

decarboxylase inhibitor, 2-(difluoromethyl)-DL-ornithine (DFMO; McConlogue, 1987, In: 
Current Communications in Molecular Biology, Cold Spring Harbor Laboratory ed.); and 
deaminase from Aspergillus terreus, which confers resistance to Blasticidin S (Tamura, 
Biosci. Biotechnol. Biochem. 59:2336-2338. 1995). Additional selectable markers include 

1 0 those that confer herbicide resistance, for example, phosphinothricin acetyltransferase gene, 
which confers resistance to phosphinothricin (White et ah, Nucl. Acids Res. 18:1062, 1990; 
Spencer et al, Theor. AddL Genet. 79:625-631, 1990), a mutant EPSPV-synthase, which 
confers glyphosate resistance (Hinchee et al., Bio/Technology 91 :91 5-922, 1998), a mutant 
acetolactate synthase, which confers imidazolione or sulfonylurea resistance (Lee et al., 

1 5 EMBO J. 7: 1241-1248, 1988), a mutant psbA, which confers resistance to atrazine (Smeda et 
al., Plant Phvsiol. 103:91 1-917, 1993), or a mutant protoporphyrinogen oxidase (see U.S. 
Patent No. 5, 767, 373), or other markers conferring resistance to an herbicide such as 
glufosinate. In addition, markers that facilitate identification of a plant cell containing the 
polynucleotide encoding the marker include, for example, luciferase (Giacomin, Plant Sci. 

20 1 16:59-72, 1996; Scikantha, J. BacterioL 178:121, 1996), green fluorescent protein (Gerdes, 
FEBS Lett. 389:44-47, 1996) or fl-glucuronidase (Jefferson, EMBO J. 6:3901-3907, 1997), 
and numerous others as disclosed herein or otherwise known in the art. Such markers also 
can be used as reporter molecules. 

A heterologous nucleotide sequence can encode an antisense molecule, particularly an 

25 antisense molecule specific for a plant stress-regulated nucleotide sequence, for example, the 
gene from which the regulatory component of the recombinant polynucleotide is derived. 
Such a recombinant polynucleotide can be useful for reducing the expression of a plant 
stress-regulated polypeptide in response to a stress condition because the antisense molecule, 
like the polypeptide, only will be induced upon exposure to the stress. A heterologous 

30 nucleotide sequence also can be, or can encode, a ribozyme or a triplexing agent. In addition 
to being useful as heterologous nucleotide sequences, such molecules also can be used 
directly in a method of the invention, for example, to modulate the responsiveness of a plant 
cell to a stress condition. Thus, an antisense molecule, ribozyme, or triplexing agent can be 
contacted directly with a target cell and, upon uptake by the cell, can effect their antisense, 
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ribozyme or triplexing activity; or can be encoded by a heterologous nucleotide sequence that 
is expressed in a plant cell from a plant stress-regulated regulatory element, whereupon it can 
effect its activity. 

An antisense polynucleotide, ribozyme or triplexing agent is complementary to a 
5 target sequence, which can be a DNA or RNA sequence, for example, messenger RNA, and 
can be a coding sequence, a nucleotide sequence comprising an intron-exon junction, a 
regulatory sequence, or the like. The degree of complementarity is such that (he 
polynucleotide, for example, an antisense polynucleotide, can interact specifically with the 
target sequence in a cell. Depending on the total length of the antisense or other 

10 polynucleotide, one or a few mismatches with respect to the target sequence can be tolerated 
without losing the specificity of the polynucleotide for its target sequence. Thus, few if any 
mismatches would be tolerated in an antisense molecule consisting, for example, of twenty 
nucleotides, whereas several mismatches will not affect the hybridization efficiency of an 
antisense molecule that is complementary, for example, to the full length of a target mRNA 

1 5 encoding a cellular polypeptide. The number of mismatches that can be tolerated can be 
estimated, for example, using well known formulas for determining hybridization kinetics 
(see Sambrook et al., "Molecular Cloning; A Laboratory Manual" 2nd Edition (Cold Spring 
Harbor Laboratory Press, Cold Spring Harbor, NY; 1989) or can be determined empirically 
using methods as disclosed herein or otherwise known in the art, particularly by determining 

20 that the presence of the antisense polynucleotide, ribozyme, or triplexing agent in a cell 

decreases the level of the target sequence or the expression of a polypeptide encoded by the 
target sequence in the cell. 

A nucleotide sequence useful as an antisense molecule, a ribozyme or a triplexing agent 
can inhibit translation or cleave a polynucleotide encoded by plant stress-regulated nucleotide 

25 sequence, thereby modulating the responsiveness of a plant cell to a stress condition. An 
antisense molecule, for example, can bind to an mRNA to form a double stranded molecule 
that cannot be translated in a cell. Antisense oligonucleotides of at least about 15 to 25 
nucleotides are typically used since they are easily synthesized and can hybridize specifically 
with a target sequence, although longer antisense molecules can be expressed from a 

30 recombinant polynucleotide introduced into the target cell. Specific nucleotide sequences 
useful as antisense molecules can be identified using well-known methods, for example, gene 
walking methods (see, for example, Seimiya et al, J. Biol. Chem. 272:4631-4636, 1997). 
Where the antisense molecule is contacted directly with a target cell, it can be operatively 
associated with a chemically reactive group such as iron-linked EDTA, which cleaves a target 
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RNA at the site of hybridization. A triplexing agent, in comparison, can stall transcription 
(Maher et al., Antisense Res. Devel. 1:227, 1991; Helene, Anticancer Drug Design 6:569, 
1991). 

A plant stress-regulated regulatory element can be included in an expression cassette. 
5 As used herein, the term "expression cassette" refers to a nucleotide sequence that can direct 
expression of a particular polynucleotide in an appropriate host cells. Expression cassettes 
typically comprise as operably linked components, a promoter, a nucleotide sequence whose 
expression is desired and a termination signal. Expression cassettes also often contain 
sequences necessary for proper translation of the sequence to be expressed along with 

10 selection and marker sequences. Thus, a plant stress-regulated regulatory element can 
constitute an expression cassette, or component thereof. An expression cassette is 
particularly useful for directing expression of a nucleotide sequence, which can be an 
endogenous nucleotide sequence or a heterologous nucleotide sequence, in a cell, particularly 
a plant cell. In general, an expression cassette can be introduced into a plant cell such that 

1 5 the plant cell, a plant resulting from the plant cell, seeds obtained from such a plant, or plants 
produced from such seeds are resistant to a stress condition. 

Additional regulatory sequences as disclosed herein or other desirable sequences such 
as selectable markers or the like can be incorporated into an expression cassette containing a 
plant stress-regulated regulatory element (see, for example, WO 99/47552). Examples of 

20 suitable markers include dihydrofolate reductase (DHFR) or neomycin resistance for 

eukaryotic cells and tetracycline or ampicillin resistance for E. coli. Selection markers in 
plants include bleomycin, gentamycin, glyphosate, hygromycin, kanamycin, methotrexate, 
phleomycin, phosphinotricin, spectinomycin, dtreptomycin, sulfonamide and sulfonylureas 
resistance. Maliga et al, Methods in Plant Molecular Biology. Cold Spring Harbor 

25 Laboratory Press, 1995, p. 39. The selection marker can have its own promoter or its 
expression can be driven by the promoter operably linked to the sequence of interest. 
Additional sequences such as intron sequences (e.g. from Adhl or bronzel) or viral leader 
sequences (e.g. from TMV, MCMV and AIVIV), all of which can enhance expression, can 
be included in the cassette. In addition, where it is desirable to target expression of a 

30 nucleotide sequence operatively linked to the stress-regulated regulatory element, a sequence 
encoding a cellular localization motif can be included in the cassette, for example, such that 
an encoded transcript or translation product is translocated to and localizes in the cytosol, 
nucleus, a chloroplast, or another subcellular organelle. Examples of usefiil transit peptides 
and transit peptide sequences can be found in Von Heijne et al. Plant Mol. Biol. Rep. 9: 104, 
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1991; Clark et al. J. Biol. Chem. 264: 17544, 1989; della-Cioppa et al. Plant Phvsiol. 84: 965, 
1987;Romeret al. Biochem. Biophvs. Res. Commun. 196: 1414, 1993; and Shah etal., 
Science 233: 478, 1986; Archer et ah, J. Bioenerg Biomembr .. 22:789, 1990; Scandalios, 
Prog. Clin. Biol. Res. 344:515, 1990; Weisbeek et al., J. Cell Sci. SuppL 11:199, 1989; 
5 Bruce, Trends Cell BioL 10:440, 2000. The present invention can utilize native or 

heterologous transit peptides. The encoding sequence for a transit peptide can include all or a 
portion of the encoding sequence for a particular transit peptide, and may also contain 
portions of the mature protein encoding sequence associated with a particular transit peptide. 
A functional portion of a plant stress-regulated plant polynucleotide, or an expression 

10 cassette, can be introduced into a cell as a naked DNA molecule, can be incorporated in a 
matrix such as a liposome or a particle such as a viral particle, or can be incorporated into a 
vector. Such vectors can be cloning or expression vectors, but other uses are within the scope 
of the present invention. A cloning vector is a self-replicating DNA molecule that serves to 
transfer a DNA segment into a host cell. The three most common types of cloning vectors 

1 5 are bacterial plasmids, phages, and other viruses. An expression vector is a cloning vector 
designed so that a coding sequence inserted at a particular site will be transcribed and 
translated into a protein. 

Incorporation of the polynucleotide into a vector can facilitate manipulation of the 
polynucleotide, or introduction of the polynucleotide into a plant cell. A vector can be 

20 derived from a plasmid or a viral vector such as a T-DNA vector (Horsch et al., Science 
227:1229-1231, 1985). If desired, the vector can comprise components of a plant 
transposable element, for example, a Ds transposon (Bancroft and Dean, Genetics 134:1221- 
1229, 1993) or an Spm transposon (Aarts et al., Mol. Gen. Genet. 247:555-564, 1995). 
In addition to containing the polynucleotide portion of a plant stress-regulated 

25 polynucleotide, a vector can contain various nucleotide sequences that facilitate, for example, 
rescue of the vector from a transformed plant cell; passage of the vector in a host cell, which 
can be a plant, animal, bacterial, or insect host cell; or expression of an encoding nucleotide 
sequence in the vector, including all or a portion of a rescued coding region. As such, the 
vector can contain any of a number of additional transcription and translation elements, 

30 including constitutive and inducible promoters, enhancers, and the like (see, for example, 
Bitter et al., Meth. Enzvmol. 153:516-544, 1987). For example, a vector can contain 
elements useful for passage, growth or expression in a bacterial system, including a bacterial 
origin of replication; a promoter, which can be an inducible promoter; and the like. In 
comparison, a vector that can be passaged in a mammalian host cell system can have a 
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promoter such as a metallothionein promoter, which has characteristics of both a constitutive 
promoter and an inducible promoter, or a viral promoter such as a retrovirus long terminal 
repeat, an adenovirus late promoter, or the like. A vector also can contain one or more 
restriction endonuclease recognition and cleavage sites, including, for example, a polylinker 
5 sequence, to facilitate rescue of a nucleotide sequence operably linked to the polynucleotide 
portion. 

The present invention also relates to a method of using a polynucleotide portion of a 
plant stress-regulated nucleotide sequence to confer a selective advantage on a plant cell. 
Such a method can be performed by introducing, for example, a plant stress-regulated 

10 regulatory element into a plant cell, wherein, upon exposure of the plant cell to a stress 
condition to which the regulatory element is responsive, a nucleotide sequence operatively 
linked to the regulatory element is expressed, thereby conferring a selective advantage to 
plant cell. The operatively linked nucleotide sequence can be a heterologous nucleotide 
sequence, which can be operatively linked to the regulatory element prior to introduction of 

15 the regulatory sequence into the plant cell; or can be an endogenous nucleotide sequence into 
which the regulatory element was targeted by a method such as homologous recombination. 
The selective advantage conferred by the operatively linked nucleotide sequence can be such 
that the plant is better able to tolerate the stress condition; or can be any other selective 
advantage. 

20 As used herein, the term "selective advantage" refers to the ability of a particular 

organism to better propagate, develop, grow, survive, or otherwise tolerate a condition as 
compared to a corresponding reference organism that does not contain a plant-stress 
regulated polynucleotide of the present invention. In one embodiment, a selective advantage 
is exemplified by the ability of a desired plant, plant cell, or the like, that contains an 

25 introduced plant stress-regulated regulatory element, to grow better than an undesired plant, 
plant cell, or the like, that does not contain the introduced regulatory element. For example, a 
recombinant polynucleotide comprising a plant stress-regulated regulatory element 
operatively linked to a heterologous nucleotide sequence encoding an enzyme that inactivates 
a herbicide can be introduced in a desired plant. Upon exposure of a mixed population of 

30 plants comprising the desired plants, which contain the recombinant polynucleotide, and one 
ormore other populations of undesired plants, which lack the recombinant polynucleotide, to 
a stress condition that induces expression of the regulatory element and to the herbicide, the 
desired plants will have a greater likelihood of surviving exposure to the toxin and, therefore, 
a selective advantage over the undesired plants. 
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In another embodiment, a selective advantage is exemplified by the ability of a 
desired plant, plant cell, or the like, to better propagate, develop, grow, survive, or otherwise 
tolerate a condition as compared to an undesired plant, plant cell, or the like, that contains an 
introduced plant stress-regulated regulatory element. For example, a recombinant 
5 polynucleotide comprising a plant stress-regulated regulatory element operatively linked to a 
plant cell toxin can be introduced into cells of an undesirable plant present in a mixed 
population of desired and undesired plants, for example, food crops and weeds, respectively, 
then the plants can be exposed to stress conditions that induce expression from the plant 
stress-regulated regulatory element, whereby expression of the plant cell toxin results in 

10 inhibition of growth or death of the undesired plants, thereby providing a selective advantage 
to the desired plants, which no longer have to compete with the undesired plants for nutrients, 
light, or the like. In another example, a plant stress-regulated regulatory element operatively 
linked to a plant cell toxin can be introduced into cells of plants used as a nurse crop. Nurse 
crops, also called cover or companion crops, are planted in combination with plants of 

15 interest to provide, among other things, shade and soil stability during establishment of the 
desired plants. Once the desired plants have become established, the presence of the nurse 
crop may no longer be desirable. Exposure to conditions inducing expression of the gene 
linked to the plant stress-regulated regulatory element allows elimination of the nurse crop. 
Alternatively nurse crops can be made less tolerant to abiotic stress by the inhibition of any 

20 of the stress-regulated sequences disclosed herein. Inhibition can be accomplished by any of 
the method described herein. Upon exposure of the nurse crop to the stress, the decreased 
ability of the nurse crop to respond to the stress will result in elimination of the nurse crop, 
leaving only the desired plants. 

The invention also provides a means of producing a transgenic plant, which comprises 

25 plant cells that exhibit altered responsiveness to a stress condition. As such, the present 
invention further provides a transgenic plant, or plant cells or tissues derived therefrom, 
which are genetically modified to respond to stress differently than a corresponding wild-type 
plant or plant not containing constructs of the present invention would respond. As used 
herein, the term "responsiveness to a stress condition" refers to the ability of a plant to 

30 express a plant stress-regulated polynucleotide upon exposure to the stress condition. A 
transgenic plant cell contains a functional portion of a plant stress-regulated polynucleotide, 
or a mutant form thereof, for example, a knock-out mutant. A knock-out mutant form of a 
plant stress-regulated nucleotide sequence can contain, for example, a mutation such that a 
STOP codon is introduced into the reading frame of the translated portion of the gene such 
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that expression of a functional stress-regulated polypeptide is prevented; or a mutation in the 
stress-regulated regulatory element such that inducibility of the element in response to a 
stress condition is inhibited. Such transgenic plants of the invention can display any of 
various idiotypic modifications is response to an abiotic stress, including altered tolerance to 
5 the stress condition, as well as increased or decreased plant growth, root growth, yield, or the 
like, as compared to the corresponding wild-type plant. 

The term "plant" is used broadly herein to include any plant at any stage of 
development, or to part of a plant, including a plant cutting, a plant cell, a plant cell culture, a 
plant organ, a plant seed, and a plantlet. A plant cell is the structural and physiological unit 

10 of the plant, comprising a protoplast and a cell wall. A plant cell can be in the form of an 
isolated single cell or a cultured cell, or can be part of higher organized unit, for example, a 
plant tissue, plant organ, or plant. Thus, a plant cell can be a protoplast, a gamete producing 
cell, or a cell or collection of cells that can regenerate into a whole plant. As such, a seed, 
which comprises multiple plant cells and is capable of regenerating into a whole plant, is 

1 5 considered plant cell for purposes of this disclosure. A plant tissue or plant organ can be a 
seed, protoplast, callus, or any other groups of plant cells that is organized into a structural or 
functional unit. Particularly useful parts of a plant include harvestable parts and parts useful 
for propagation of progeny plants. A harvestable part of a plant can be any useful part of a 
plant, for example, flowers, pollen, seedlings, tubers, leaves, stems, fruit, seeds, roots, and the 

20 like. A part of a plant useful for propagation includes, for example, seeds, fruits, cuttings, 
seedlings, tubers, rootstocks, and the like. 

A transgenic plant can be regenerated from a transformed plant cell. As used herein, 
the term "regenerate" means growing a whole plant from a plant cell; a group of plant cells; a 
protoplast; a seed; or a piece of a plant such as a callus or tissue. Regeneration from 

25 protoplasts varies from species to species of plants. For example, a suspension of protoplasts 
can be made and, in certain species, embryo formation can be induced from the protoplast 
suspension, to the stage of ripening and germination. The culture media generally contains 
various components necessary for growth and regeneration, including, for example, 
hormones such as auxins and cytokinins; and amino acids such as glutamic acid and proline, 

30 depending on the particular plant species. Efficient regeneration will depend, in part, on the 
medium, the genotype, and the history of the culture. If these variables are controlled, 
however, regeneration is reproducible. 

Regeneration can occur from plant callus, explants, organs or plant parts. 
Transformation can be performed in the context of organ or plant part regeneration, (see 
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Meth. Enzvmol. Vol. 1 1 8; Klee et al. Ann. Rev. Plant Phvsiol. 38:467 (1987)). Utilizing the 
leaf disk-transformation-regeneration method, for example, disks are cultured on selective 
media, followed by shoot formation in about two to four weeks (see Horsch et aJ., Science 
227:1229, 1985). Shoots that develop are excised from calli and transplanted to appropriate 
5 root-inducing selective medium. Rooted plantlets are transplanted to soil as soon as possible 
after roots appear. The plantlets can be repotted as required, until reaching maturity. 

In vegetatively propagated crops, the mature transgenic plants are propagated utilizing 
cuttings or tissue culture techniques to produce multiple identical plants. Selection of 
desirable transgenotes is made and new varieties are obtained and propagated vegetatively for 

10 commercial use. In seed propagated crops, the mature transgenic plants can be self crossed to 
produce a homozygous inbred plant. The resulting inbred plant produces seeds that contain 
the introduced plant stress-induced regulatory element, and can be grown to produce plants 
that express a polynucleotide or polypeptide in response to a stress condition that induces 
expression from the regulatory element. As such, the invention further provides seeds 

1 5 produced by a transgenic plant obtained by a method of the invention. 

In addition, transgenic plants comprising different recombinant sequences can be 
crossbred, thereby providing a means to obtain transgenic plants containing two or more 
different transgenes, each of which contributes a desirable characteristic to the plant. 
Methods for breeding plants and selecting for crossbred plants having desirable 

20 characteristics or other characteristics of interest are well known in the art. 

A method of the invention can be performed by introducing a functional portion of a 
plant stress-regulated nucleotide sequence into the plant. As used herein, the term 
"introducing" means transferring a polynucleotide into a plant cell. A polynucleotide can be 
introduced into a cell by a variety of methods well known to those of ordinary skill in the art. 

25 For example, the polynucleotide can be introduced into a plant cell using a direct gene 

transfer method such as electroporation or microprojectile mediated transformation, or using 
Agrobacterium mediated transformation. Non-limiting examples of methods for the 
introduction of polynucleotides into plants are provided in greater detail herein. As used 
herein, the term "transformed" refers to a plant cell containing an exogenously introduced 

30 polynucleotide portion of a plant stress-regulated nucleotide sequence that is or can be 
rendered active in a plant cell, or to a plant comprising a plant cell containing such a 
polynucleotide. 

It should be recognized that one or more polynucleotides, which are the same or 
different can be introduced into a plant, thereby providing a means to obtain a genetically 
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modified plant containing multiple copies of a single transgenic sequence, or containing two 
or more different transgenic sequences, either or both of which can be present in multiple 
copies. Such transgenic plants can be produced, for example, by simply selecting plants 
having multiple copies of a single type of transgenic sequence; by co-transfecting plant cells 
5 with two or more populations of different transgenic sequences and identifying those 
containing the two or more different transgenic sequences; or by crossbreeding transgenic 
plants, each of which contains one or more desired transgenic sequences, and identifying 
those progeny having the desired sequences. 

Methods for introducing a polynucleotide into a plant cell to obtain a transformed 

10 plant also include direct gene transfer (see European Patent A 164 575), injection, 
electroporation, biolistic methods such as particle bombardment, pollen-mediated 
transformation, plant RNA virus-mediated transformation, liposome-mediated 
transformation, transformation using wounded or enzyme-degraded immature embryos, or 
wounded or enzyme-degraded embryogenic callus, and the like. Transformation methods 

1 5 using Agrobacterium twnefaciens tumor inducing (Ti) plasmids or root-inducing (Ri) 

plasmids, or plant virus vectors are well known in the art (see, for example, WO 99/47552; 
Weissbach & Weissbach, "Methods for Plant Molecular Biology", Academic Press, 1988), 
section Vm, pages 421-463; Grierson and Corey, "Plant Molecular Biology" 2d Ed. Blackie, 
London, 1988), Chapters 7-9; Horsch et ah, Science 227:1229, 1985). The wild-type form of 

20 Agrobacterium, for example, contains a Ti plasmid, which directs production of tumorigenic 
crown gall growth on host plants. Transfer of the tumor inducing T-DNA region of the Ti 
plasmid to a plant genome requires the Ti plasmid-encoded virulence genes as well as T-DNA 
borders, which are a set of direct DNA repeats that delineate the region to be transferred. An 
Agrobacterium based vector is a modified form of a Ti plasmid, in which the tumor inducing 

25 functions are replaced by a nucleotide sequence of interest that is to be introduced into the 
plant host. 

Methods of using Agrobacterium mediated transformation include cocultivation of 
Agrobacterium with cultured isolated protoplasts; transformation of plant cells or tissues with 
Agrobacterium; and transformation of seeds, apices or meristems with Agrobacterium, In 
30 addition, in planta transformation by Agrobacterium can be performed using vacuum 
infiltration of a suspension of Agrobacterium cells (Bechtold et al., C.R. Acad. Sci. Paris 
316:1194,1993). 

Agrobacterium mediated transformation can employ cointegrate vectors or binary 
vector systems, in which the components of the Ti plasmid are divided between a helper 
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vector, which resides permanently in the Agrobacterium host and carries the virulence genes, 
and a shuttle vector, which contains the gene of interest bounded by T-DNA sequences. 
Binary vectors are well known in the art (see, for example, DeFramond, BioTechnology 
1 :262, 1983; Hoekema et al., Nature 303:179, 1983) and are commercially available 
5 (Clontech; Palo Alto CA). For transformation, Agrobacterium can be cocultured, for 
example, with plant cells or wounded tissue such as leaf tissue, root explants, 
hypocotyledons, stem pieces or tubers (see, for example, Glick and Thompson, "Methods in 
Plant Molecular Biology and Biotechnology" ,Boca Raton FL, CRC Press, 1993). Wounded 
cells within the plant tissue that have been infected by Agrobacterium can develop organs de 

10 novo when cultured under the appropriate conditions; the resulting transgenic shoots 

eventually give rise to transgenic plants, which contain an exogenous polynucleotide portion 
of a plant stress-regulated nucleotide sequence. 

Agrobacterium mediated transformation has been used to produce a variety of 
transgenic plants, including, for example, transgenic cruciferous plants such as Arabidopsis, 

15 mustard, rapeseed and flax; transgenic leguminous plants such as alfalfa, pea, soybean, trefoil 
and white clover; and transgenic solanaceous plants such as eggplant, petunia, potato, 
tobacco and tomato (see, for example, Wang et al., "Transformation of Plants and Soil 
Microorganisms", Cambridge University Press, 1995). In addition, Agrobacterium mediated 
transformation can be used to introduce an exogenous polynucleotide sequence, for example, 

20 a plant stress-regulated regulatory element into apple, aspen, belladonna, black currant, 
carrot, celery, cotton, cucumber, grape, horseradish, lettuce, morning glory, muskmelon, 
neem, poplar, strawberry, sugar beet, sunflower, walnut, asparagus, rice and other plants (see, 
for example, Glick and Thompson, supra, 1993; Hiei et al., Plant J. 6:271-282, 1994; 
Shimamoto, Science 270:1772-1773, 1995). 

25 Suitable strains of Agrobacterium tumefaciens and vectors as well as transformation 

of Agrobacteria and appropriate growth and selection media are well known in the art 
(GV3101, pMK90RK), Koncz, Mol. Gen. Genet. 204:383-396, 1986; (C58C1, 
pGV3850kan), Deblaere, Nucl. Acid Res. 13:4777, 1985; Bevan, Nucleic Acid Res. 12:8711, 
1984; Koncz, Proc. Natl Acad. Sci. USA 86:8467-8471, 1986; Koncz, Plant Mol. Biol. 

30 20:963-976, 1992; Koncz, "Specialized vectors for gene tagging and expression studies", in: 
Plant Molecular Biology Manual Vol. 2, Gelvin and Schilperoort (Eds.), Dordrecht, The 
Netherlands: Kluwer Academic Publ. (1994), 1-22; European Patent A-l 20 516; Hoekema, 
"The Binary Plant Vector System", Offsetdrukkerij Kanters B. V., Alblasserdam (1985), 
Chapter V; Fraley, Crit. Rev. Plant. Sci. , 4:1-46; An, EMBOJ. 4:277-287, 1985). 
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Where a polynucleotide portion of a plant stress-regulated nucleotide sequence is 
contained in vector, the vector can contain functional elements, for example "left border" and 
"right border" sequences of the T-DNA of Agrobacterium, which allow for stable integration 
into a plant genome. Furthermore, methods and vectors that permit the generation of 

5 marker-free transgenic plants, for example, where a selectable marker gene is lost at a certain 
stage of plant development or plant breeding, are known, and include, for example, methods 
of co-transformation (Lyznik, Plant Mol. Biol. 13:151-161, 1989; Peng, Plant Mol. Biol. 
27:91- 104, 1995), or methods that utilize enzymes capable of promoting homologous 
recombination in plants (see, e.g., W097/08331; Bayley, Plant Mol. Biol. 18:353-361, 1992; 

10 Lloyd, Mol. Gen. Genet. 242:653-657, 1994; Maeser, Mol. Gen. Genet. 230:170-176, 1991; 
Onouchi, Nucl. Acids Res. 19:6373-6378, 1991; see, also, Sambrook et al., supra, 1989). 

A direct gene transfer method sucfras elcctroporation also can be used to introduce a 
polynucleotide portion of a plant stress-regulated nucleotide sequence into a cell such as a 
plant cell. For example, plant protoplasts can be electroporated in the presence of the 

15 regulatory element, which can be in a vector (Fromm.et al., Proc. Natl. Acad. Sci.. USA 
82:5824, 1985). Electrical impulses of high field strength reversibly permeabilize 
membranes allowing the introduction of the nucleic acid. Electroporated plant protoplasts 
reform the cell wall, divide and form a plant callus. Microinjection can be performed as 
described in Potrykus and Spangenberg (eds.), Gene Transfer To Plants, Springer Verlag, 

20 Berlin, NY, 1995. A transformed plant cell containing the introduced polynucleotide can be 
identified by detecting a phenotype due to the introduced polynucleotide, for example, 
increased or decreased tolerance to a stress condition. 

Microprojectile mediated transformation also can be used to introduce a 
polynucleotide into a plant cell (Klein et al, Nature 327:70-73 (1987)). This method utilizes 

25 microprojectiles such as gold or tungsten, which are coated with the desired nucleic acid 
molecule by precipitation with calcium chloride, spermidine or polyethylene glycol. The 
microprojectile particles are accelerated at high speed into a plant tissue using a device such 
as the BIOLISTIC PD-1000 (Biorad; Hercules CA). 

Microprojectile mediated delivery ("particle bombardment") is especially useful to 

30 transform plant cells that are difficult to transform or regenerate using other methods. 

Methods for the transformation using biolistic methods are well known (Wan, Plant Physiol. 
104:37-48, 1984; Vasil. Bio/Technology 11:1553-1558, 1993; Christou, Trends m Plant 
Science 1:423-431, 1996). Microprojectile mediated transformation has been used, for 
example, to generate a variety of transgenic plant species, including cotton, tobacco, corn, 



WO 03/008540 PCT/US02/19668 

55 

hybrid poplar and papaya (see Glick and Thompson, supra, 1 993). Important cereal crops 
such as wheat, oat, barley, sorghum and rice also have been transformed using 
microprojectile mediated delivery (Duan et ah, Nature Biotech. 14:494-498, 1996; 
Shimamoto, Curr. Qpin. Biotech. 5:158-162, 1994). A rapid transformation regeneration 
5 system for the production of transgenic plants such as a system that produces transgenic 

wheat in two to three months (see European Patent No. EP 0709462A2) also can be useful for 
producing a transgenic plant using a method of the invention, thus allowing more rapid 
identification of gene functions. The transformation of most dicotyledonous plants is 
possible with the methods described above. Transformation of monocotyledonous plants also 

10 can be transformed using, for example, biolistic methods as described above, protoplast 
transformation, electroporation of partially permeabilized cells, introduction of DNA using 
glass fibers, Agrobacterium mediated transformation, and the like. 

Plastid transformation also can be used to introduce a polynucleotide portion of a 
plant stress-regulated nucleotide sequence into a plant cell (U.S. Patent Nos. 5,451,513, 

15 5,545,817, and 5,545,818; WO 95/16783; McBride et al M Proc. Natl. Acad. Sci.. USA 
91:7301-7305, 1994). Chloroplast transformation involves introducing regions of cloned 
plastid DNA flanking a desired nucleotide sequence, for example, a selectable marker 
together with polynucleotide of interest into a suitable target tissue, using, for example, a 
biolistic or protoplast transformation method (e.g., calcium chloride or PEG mediated 

20 transformation). One to 1.5 kb flanking regions ("targeting sequences") facilitate 
homologous recombination with the plastid genome, and allow the replacement or 
modification of specific regions of the plastome. Using this method, point mutations in the 
chloroplast 16S rRNA and rpsl2 genes, which confer resistance to spectinomycin and 
streptomycin, can be utilized as selectable markers for transformation (Svab et al. 9 Proc. Natl. 

25 Acad. ScL USA 87:8526-8530, 1990; Staub and Maliga, Plant Cell 4:39-45, 1992), resulted 
in stable homopiasmic transformants; at a frequency of approximately one per 100 
bombardments of target leaves. The presence of cloning sites between these markers allowed 
creation of a plastid targeting vector for introduction of foreign genes (Staub and Maliga, 
EMBO J. 12:601-606, 1993). Substantial increases in transformation frequency are obtained 

30 by replacement of the recessive rRNA or r-protein antibiotic resistance genes with a 
dominant selectable marker, the bacterial aadA gene encoding the spectinomycin- 
detoxifying enzyme aminoglycoside-3'-adenyltransf erase (Svab and Maliga, Proc. NatL 
Acad. ScL USA 90:913-917, 1993). Approximately 1 5 to 20 cell division cycles following 
transformation are generally required to reach a homoplastidic state. Plastid expression, in 
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which genes are inserted by homologous recombination into all of the several thousand 
copies of the circular plastid genome present in each plant cell, takes advantage of the 
enormous copy number advantage over nuclear-expressed genes to permit expression levels 
that can readily exceed 1 0% of the total soluble plant protein. 
5 Plants suitable to treatment according to a method of the invention can be monocots 

or dicots and include, but are not limited to, com (Zea mays), Brassica sp. (e.g., B. napus, B. 
rapa, B.juncea), particularly those Brassica species useful as sources of seed oil, alfalfa 
(Medicago saliva), rice (Oryza saliva), rye (Secale cereale), sorghum (Sorghum bicolor, 
Sorghum vulgare), millet (e.g., pearl millet (Penniselum glaucum), proso millet (Panicum 

10 miliaceum), foxtail millet (Selaria italica), finger millet (Eleusine coracana)), sunflower 
(Helianthus annuus), safflower (Carthamus tinctorius), wheat (Triticum aestivum), soybean 
(Glycine max), tobacco (Nicotiana tabacum), potato (Solatium tuberosum), peanuts (Arachis 
hypogaea), cotton (Gossypium barbadense, Gossypium hirsutum), sweet potato (Ipomoea 
batatus), cassava (Manihot esculenta), coffee (Cofea spp.), coconut (Cocos nucifera), 

1 5 pineapple (Ananas comosus), citrus trees (Cilrus spp.), cocoa (Theobroma cacao), tea 

(Camellia sinensis), banana (Musa spp.), avocado (Persea ullilane), fig (Ficus casica), guava 
(Psidium guajava), mango (Mangifera indica), olive (Olea europaea), papaya (Carica 
papaya), cashew (Anacardium occidental), macadamia (Macadamia integrifolia), almond 
(Prunus amygdalus), sugar beets (Beta vulgaris), sugarcane (Saccharum spp.), oats, 

20 duckweed (Lemna), barley, tomatoes (Lycopersicon esculentum), lettuce (e.g., Lactuca 

sativa), green beans (Phaseolus vulgaris), lima beans (Phaseolus limensis), peas (Lathyrus 
spp.), and members of the genus Cucumis such as cucumber (C. sativus), cantaloupe (C. 
cantalupensis), and musk melon (C. melo). 

Ornamentals such as azalea (Rhododendron spp.), hydrangea (Macrophylla 

25 hydrangea), hibiscus (Hibiscus rosasanensis), roses (Rosa spp.), tulips (Tulipa spp.), 

daffodils (Narcissus spp.), petunias (Petunia hybrida), carnation (Dianthus caryophyllus), 
poinsettia (Euphorbia pulcherrima), and chrysanthemum are also included. Additional 
ornamentals within the scope of the invention include impatiens, Begonia, Pelargonium, 
Viola, Cyclamen, Verbena, Vinca, Tagetes, Primula, Saint Paulia, Agertum, Amaranthus, 

30 Antihirrhinum, Aquilegia, Cineraria, Clover, Cosmo, Cowpea, Dahlia, Datura, Delphinium, 
Gerbera, Gladiolus, Gloxinia, Hippeastrum, Mesembryanthemum, Salpiglossos, and Zinnia. 

Conifers that may be employed in practicing the present invention include, for 
example, pines such as loblolly pine (Pinus taeda), slash pine (Pinus elliotii), ponderosa pine 
(Pinus ponderosa), lodgepole pine (Pinus contorta), and Monterey pine (Pinus radiata), 
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Douglas-fir (Pseudotsuga menziesii); Western hemlock (Tsuga ultilane); Sitka spruce (Picea 
glauca)\ redwood (Sequoia sempervirens); true firs such as silver fir (Abies amabiiis) and 
balsam fir (Abies balsamea)\ and cedars such as Western red cedar (Thuja plicata) and 
Alaska yellow-cedar (Chamaecyparis nootkatensis). 
5 Leguminous plants which may be used in the practice of the present invention include 

beans and peas. Beans include guar, locust bean, fenugreek, soybean, garden beans, cowpea, 
mungbean, lima bean, fava bean, lentils, chickpea, etc. Legumes include, but are not limited 
to, Arachis, e.g., peanuts, Vicia, e.g., crown vetch, hairy vetch, adzuki bean, mung bean, and 
chickpea, Lupinus, e.g., lupine, trifolium, Phaseolus, e.g., common bean and lima bean, 
10 Pisum, e.g., field bean, Melilotus, e.g., clover, Medicago, e.g., alfalfa, Lotus, e.g., trefoil, 
lens, e.g., lentil, and false indigo. Forage and turf grass for use in the methods of the 
invention include alfalfa, orchard grass, tall fescue, perennial ryegrass, creeping bent grass, 
and redtop. 

Other plants within the scope of the invention include Acacia, aneth, artichoke, arugula, 
15 blackberry, canola, cilantro, Clementines, escarole, eucalyptus, fennel, grapefruit, honey dew, 

jicama, kiwifruit, lemon, lime, mushroom, nut, okra, orange, parsley, persimmon, plantain, 

pomegranate, poplar, radiata pine, radicchio, Southern pine, sweetgum, tangerine, triticale, 

vine, yams, apple, pear, quince, cherry, apricot, melon, hemp, buckwheat, grape, raspberry, 

chenopodium, blueberry, nectarine, peach, plum, strawberry, watermelon, eggplant, pepper, 
20 cauliflower, Brassica, e.g., broccoli, cabbage, ultilan sprouts, onion, carrot, leek, beet, broad 

bean, celery, radish, pumpkin, endive, gourd, garlic, snapbean, spinach, squash, turnip, 

ultilane, chicory, groundnut and zucchini. 

Angiosperms are divided into two broad classes based on the number of cotyledons, 

which are seed leaves that generally store or absorb food; a monocotyledonous angiosperm 
25 has a single cotyledon, and a dicotyledonous angiosperm has two cotyledons. Angiosperms 

produce a variety of useful products including materials such as lumber, rubber, and paper; 

fibers such as cotton and linen; herbs and medicines such as quinine and vinblastine; 

ornamental flowers such as roses and orchids; and foodstuffs such as grains, oils, fruits and 

vegetables. 

30 Angiosperms encompass a variety of flowering plants, including, for example, cereal 

j>lants, leguminous plants, oilseed plants, hardwood trees, fruit-bearing plants and ornamental 
flowers, which general classes are not necessarily exclusive. Cereal plants, which produce an 
edible grain cereal and are suitable for use in the present invention, include, for example, 
com, rice, wheat, barley, oat, rye, millet, orchardgrass, guinea grass, sorghum and turfgrass. 
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Leguminous plants include members of the pea family (Fabaceae) and produce a 
characteristic fruit known as a legume. Examples of leguminous plants include, for example, 
soybean, pea, chickpea, moth bean, broad bean, kidney bean, lima bean, lentil, cowpea, dry 
bean, and peanut, as well as alfalfa, birdsfoot trefoil, clover and sainfoin. Oilseed plants, 
5 which have seeds that are useful as a source of oil, include soybean, sunflower, rapeseed 
(canola) and cottonseed. 

Angiosperms also include hardwood trees, which are perennial woody plants that 
generally have a single stem (trunk). Examples of such trees include alder, ash, aspen, 
basswood (linden), beech, birch, cherry, cottonwood, elm, eucalyptus, hickory, locust, maple, 

10 oak, persimmon, poplar, sycamore, walnut, sequoia, and willow. Trees are useful, for 
example, as a source of pulp, paper, structural material and fuel. 

Angiosperms are fruit-bearing plants that produce a mature, ripened ovary, which 
generally contains seeds. A fruit can be suitable for human or animal consumption or for 
collection of seeds to propagate the species. For example, hops are a member of the 

1 5 mulberry family that are prized for their flavoring in malt liquor. Fruit-bearing angiosperms 
also include grape, orange, lemon, grapefruit, avocado, date, peach, cherry, olive, plum, 
coconut, apple and pear trees and blackberry, blueberry, raspberry, strawberry, pineapple, 
tomato, cucumber and eggplant plants. An ornamental flower is an angiosperm cultivated for 
its decorative flower. Examples of commercially important ornamental flowers include rose, 

20 orchid, lily, tulip and chrysanthemum, snapdragon, camellia, carnation and petunia plants. 
The skilled artisan will recognize that the methods of the invention can be practiced using 
these or other angiosperms, as desired, as well as gymnosperms, which do not produce seeds 
in a fruit. 

A method of producing a transgenic plant can be performed by introducing a functional 
25 portion of plant stress-regulated polynucleotide into a plant cell genome, whereby the 

functional portion of the plant stress-regulated polynucleotide modulates a response of the 
plant cell to a stress condition, thereby producing a transgenic plant, which comprises plant 
cells that exhibit altered responsiveness to the stress condition. In one embodiment, the 
functional portion of the plant stress-regulated polynucleotide encodes a stress-regulated 
30 polypeptide or functional peptide portion thereof, wherein expression of the stress-regulated 
polypeptide or functional peptide portion thereof either increases the stress tolerance of the 
transgenic plant, or decreases the stress tolerance of the transgenic plant. The functional 
portion of the plant stress-regulated nucleotide sequence encoding the stress-regulated 
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polypeptide or functional peptide portion thereof can be operatively linked to a heterologous 
promoter. 

In another embodiment, the polynucleotide portion of the plant stress-regulated 
nucleotide sequence comprises a stress-regulated regulatory element. The stress-regulated 
5 regulatory element can integrate into the plant cell genome in a site-specific manner, 

whereupon it can be operatively linked to an endogenous nucleotide sequence, which can be 
expressed in response to a stress condition specific for the regulatory element; or can be a 
mutant regulatory element, which is not responsive to the stress condition, whereby upon 
integrating into the plant cell genome, the mutant regulatory element disrupts an endogenous 

10 stress-regulated regulatory element of a plant stress-regulated nucleotide sequence, thereby 
altering the responsiveness of the plant stress-regulated nucleotide sequence to the stress 
condition. Accordingly, the invention also provides genetically modified plants, including 
transgenic plants, produced by such a method, and a plant cell obtained from such genetically 
modified plant, wherein said plant cell exhibits altered responsiveness to the stress condition; 

1 5 a seed produced by a transgenic plant; and a cDNA library prepared from a transgenic plant. 

Also provided is a method of modulating the responsiveness of a plant cell to a stress 
condition. Such a method can be performed, for example, by introducing a functional portion 
of a plant stress-regulated nucleotide sequence into the plant cell, thereby modulating the 
responsiveness of the plant cell to a stress condition. As disclosed herein, the responsiveness 

20 of the plant cell can be increased or decreased upon exposure to the stress condition, and the 
altered responsiveness can result in increased or decreased tolerance of the plant cell to a 
stress condition. The functional portion of the plant stress-regulated polynucleotide can, but 
need not, be integrated into the genome of the plant cell, thereby modulating the 
responsiveness of the plant cell to the stress condition. Accordingly, the invention also 

25 provide a genetically modified plant, including a transgenic plant, which contains an 

introduced polynucleotide portion of a plant stress-regulated nucleotide sequence, as well as 
plant cells, tissues, and the like, which exhibit modulated responsiveness to a stress 
condition. 

The functional portion of the plant stress-regulated polynucleotide can encode a 
30 stress-regulated polypeptide or functional peptide portion thereof, which can be operatively 
linked to a heterologous promoter. As used herein, reference to a "functional peptide portion 
of a plant stress-regulated polypeptide" means a contiguous amino acid sequence of the 
polypeptide that has at least 50%, at least 75%, at least 90%, or at least 95% the activity of 
the full length polypeptide, or that has an antagonist activity with respect to the fiill length 
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polypeptide, or that presents an epitope unique to the polypeptide. Thus, by expressing a 
functional peptide portion of a plant stress-regulated polypeptide in a plant cell, the peptide 
can act as an agonist or an antagonist of the polypeptide, thereby modulating the 
responsiveness of the plant cell to a stress condition. It should be noted that while the 
5 functional peptide portion has an activity of the full length polypeptide, the activity need not 
be of the same magnitude. Thus, the functional peptide portion can have an activity with is 
greater or lesser in magnitude than the full length polypeptide. 

A functional portion of the plant stress-regulated polynucleotide also can contain a 
mutation, whereby upon integrating into the plant cell genome, the polynucleotide disrupts 

1 0 (knocks-out) an endogenous plant stress-regulated nucleotide sequence, thereby modulating 
the responsiveness of said plant cell to the stress condition. Depending on whether the 
knocked-out gene encodes an adaptive or a maladaptive stress-regulated polypeptide, the 
responsiveness of the plant will be modulated accordingly. Thus, a method of the invention 
provides a means of producing a transgenic plant having a knock-out phenotype of a plant 

1 5 stress-regulated nucleotide sequence. 

Alternatively, the responsiveness of a plant or plant cell to a stress condition can be 
modulated by use of a suppressor construct containing dominant negative mutation for any of 
the stress-regulated polynucleotides described herein. Expression of a suppressor construct 
containing a dominant mutant mutation generates a mutant transcript that, when coexpressed 

20 with the wild-type transcript inhibits the action of the wild-type transcript. Methods for the 
design and use of dominant negative constructs are well known in the art and can be found, 
for example, in Herskowitz, Nature , 329:219-222. 1987 and Lagna and Hemmati-Brivanlou, 
Curr. Topics Devel. Biol .. 36:75-98, 1998. 

The functional portion of the plant stress-regulated polynucleotide sequence also can 

25 comprise a stress-regulated regulatory element, which can be operatively linked to a 

heterologous nucleotide sequence, which, upon expression from the regulatory element in 
response to a stress condition, modulates the responsiveness of the plant cell to the stress 
condition. Such a heterologous nucleotide sequence can encode, for example, a stress- 
inducible transcription factor such as DREB1 A, which, upon exposure to the stress condition, 

30 is expressed such that it can amplify the stress response (see Kasuga et al., Nat. BiotechnoL 
_ 17:287-291, 1999). The heterologous nucleotide sequence also can encode a polynucleotide 
that is specific for a plant stress-regulated nucleotide sequence, for example, an antisense 
molecule, a ribozyme, and a triplexing agent, either of which, upon expression in the plant 
cell, reduces or inhibits expression of a stress-regulated polypeptide encoded by the gene, 
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thereby modulating the responsiveness of the plant cell to a stress condition, for example, an 
abnormal level of cold, osmotic pressure, and salinity. As used herein, the term "abnormal," 
when used in reference to a condition such as temperature, osmotic pressure, salinity, or any 
other condition that can be a stress condition, means that the condition varies sufficiently 
5 from a range generally considered optimum for growth of a plant that the condition results in 
an induction of a stress response in a plant. Methods of determining whether a stress 
response has been induced in a plant are disclosed herein or otherwise known in the art. 

A plant stress-regulated regulatory element can be operatively linked to a heterologous 
polynucleotide sequence, such that the regulatory element can be introduced into a plant genome 

10 in a site-specific matter by homologous recombination. For example, a mutant plant stress- 
regulated regulatory element for a maladaptive stress-induced polypeptide can be 
transformed into a plant genome in a site specific manner by in vivo mutagenesis, using a 
hybrid RNA-DNA oligonucleotide ("chimeroplast" (TIBTECH 15:441- 447, 1997; 
W0 95/15972; Kren, Hepatology 25:1462-1468, 1997; Cole-Strauss, Science 273:1386-1389, 

15 1996). Part of the DNA component of the RNA-DNA oligonucleotide is homologous to a 
nucleotide sequence comprising the regulatory element of the maladaptive gene, but includes 
a mutation or contains a heterologous region which is surrounded by the homologous regions. 
By means of base pairing of the homologous regions of the RNA-DNA oligonucleotide and 
of the endogenous nucleic acid molecule, followed by a homologous recombination the 

20 mutation contained in the DNA component of the RNA-DNA oligonucleotide or the 

heterologous region can be transferred to the plant genome, resulting in a "mutant" gene that, 
for example, is not induced in response to a stress and, therefore, does not confer the 
maladaptive phenotype. Such a method similarly can be used to knock-out the activity of a 
stress-regulated nucleotide sequence, for example, in an undesirable plant. Such a method 

25 can provide the advantage that a desirable wild-type plant need not compete with the 
undesirable plant, for example, for light, nutrients, or the like. 

A method of modulating the responsiveness of a plant cell to a stress condition also 
can be performed by introducing a mutation in the chromosomal copy of a plant stress- 
regulated nucleotide sequence, for example, in the stress-regulated regulatory element, by 

30 transforming a cell with a chimeric oligonucleotide composed of a contiguous stretch of 

RNA and DNA residues in a duplex conformation with double hairpin caps on the ends. An 
additional feature of the oligonucleotide is the presence of 2'-0- methylation at the RNA 
residues. The RNA/DNA sequence is designed to align with the sequence of a chromosomal 
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copy of the target regulatory element and to contain the desired nucleotide change (see U.S. 
Patent No. 5,501,967). 

A plant stress-regulated regulatory element also can be operatively linked to a 
heterologous polynucleotide such that, upon expression from the regulatory element in the plant 
5 cell, confers a desirable phenotype on the plant cell. For example, the heterologous 
polynucleotide can encode an aptamer; which can bind to a stress-induced polypeptide. 
Aptamers are nucleic acid molecules that are selected based on their ability to bind to and 
inhibit the activity of a protein or metabolite. Aptamers can be obtained by the SELEX 
(Systematic Evolution of Ligands by Exponential Enrichment) method (see U.S. Patent No. 

10 5,270,163), wherein a candidate mixture of single stranded nucleic acids having regions of 
randomized sequence is contacted with a target, and those nucleic acids having a specific 
affinity to the target are partitioned from the remainder of the candidate mixture, and 
amplified to yield a ligand enriched mixture. After several iterations a nucleic acid molecule 
(aptamer) having optimal affinity for the target is obtained. For example, such a nucleic acid 

1 5 molecule can be operatively linked to a plant stress-regulated regulatory element and 
introduced into a plant. Where the aptamer is selected for binding to a polypeptide that 
normally is expressed from the regulatory element and is involved in an adaptive response of 
the plant to a stress, the recombinant molecule comprising the aptamer can be useful for 
inhibiting the activity of the stress-regulated polypeptide, thereby decreasing the tolerance of 

20 the plant to the stress condition. 

The invention provides a genetically modified plant, which can be a transgenic plant, 
that is tolerant or resistant to a stress condition. As used herein, the term "tolerant" or 
"resistant," when used in reference to a stress condition of a plant, means that the particular 
plant, when exposed to a stress condition, shows less of an effect, or no effect, in response to 

25 the condition as compared to a corresponding reference plant (naturally occurring wild-type 
plant or a plant not containing a construct of the present invention). As a consequence, a 
plant encompassed within the present invention grows better under more widely varying 
conditions, has higher yields and/or produces more seeds. Thus, a transgenic plant produced 
according to a method of the invention can demonstrate protection (as compared to a 

30 corresponding reference plant) from a delay to complete inhibition of alteration in cellular 
metabolism, or reduced cell growth or cell death caused by the stress. Preferably, the 
transgenic plant is capable of substantially normal growth under environmental conditions 
where the corresponding reference plant shows reduced growth, metabolism or viability, or 
increased male or female sterility. 
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The determination that a plant modified according to a method of the invention has 
increased resistance to a stress-inducing condition can be made by comparing the treated 
plant with a control (reference) plant using well known methods. For example, a plant 
having increased tolerance to saline stress can be identified by growing the plant on a 
5 medium such as soil, which contains a higher content of salt in the order of at least about 
10% compared to a medium the corresponding reference plant is capable of growing on. 
Advantageously, a plant treated according to a method of the invention can grow on a 
medium or soil containing at least about 50%, or more than about 75%, or more than about 
100%, or more than about 200% salt than the medium or soil on which a corresponding 

10 reference plant can grow. In particular, such a treated plant can grow on medium or soil 
containing at least 40 mM, at least 100 rnM, at least 200 mM, or at least 300 mM salt, 
including, for example, a water soluble inorganic salt such as sodium sulfate, magnesium 
sulfate, calcium sulfate, sodium chloride, magnesium chloride, calcium chloride, potassium 
chloride, or the like; salts of agricultural fertilizers, and salts associated with alkaline or acid 

1 5 soil conditions; particularly NaCl. 

In another embodiment, the invention provides a plant that is less tolerant or less 
resistant to a stress condition as compared to a corresponding reference plant. As used 
herein, the term "less tolerant" or "less resistant," when used in reference to a stress condition 
of a plant, means that the particular plant, when exposed to a stress condition, shows an 

20 alteration in response to the condition as compared to a corresponding reference plant. In one 
embodiment, the alteration is response is at least 5%, in another at least 10% and in still 
another at least 25% when compared to the reference plant. As a consequence, such a plant, 
which generally is an undesirable plant species, is less likely to grow when exposed to a 
stress condition than an untreated plant. 

25 The present invention also relates to a method of expressing a heterologous nucleotide 

sequence in a plant cell. Such a method can be performed, for example, by introducing into 
the plant cell a plant stress-regulated regulatory element operatively linked to the 
heterologous nucleotide sequence, whereby, upon exposure of the plant cell to stress 
condition, the heterologous nucleotide sequence is expressed in the plant cell. The 

30 heterologous nucleotide sequence can encode a selectable marker, or a polypeptide that 
confers a desirable trait upon the plant cell, for example, a polypeptide that improves the 
nutritional value, digestibility or ornamental value of the plant cell, or a plant comprising the 
plant cell. Accordingly, the invention provides a transgenic plant that, in response to a stress 
condition, can produce a heterologous polypeptide from a plant stress-regulated regulatory 
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element. Such transgenic plants can provide the advantage that, when grown in a cold 
environment for example, expression of the heterologous polypeptide from a plant cold- 
regulated regulatory element can result in increased nutritional value of the plant. 

The present invention further relates to a method of modulating the activity of a 
5 biological pathway in a plant cell, wherein the pathway involves a stress-regulated 
polypeptide. As used herein, reference to a pathway that "involves" a stress-regulated 
polypeptide means that the polypeptide is required for normal function of the pathway. For 
example, plant stress-regulated polypeptides as disclosed herein include those acting as 
kinases or as transcription factors, which are well known to be involved in signal 

10 transduction pathways. As such, a method of the invention provides a means to modulate 
biological pathways involving plant stress-regulated polypeptides, for example, by altering 
the expression of the polypeptides in response to a stress condition. Thus, a method of the 
invention can be performed, for example, by introducing a polynucleotide portion of a plant 
stress-regulated nucleotide sequence into the plant cell, thereby modulating the activity of the 

1 5 biological pathway. A method of the invention can be performed with respect to a pathway 
involving any of the stress-regulated polypeptides as encoded by a polynucleotide of 
disclosed herein, including for example, a stress-regulated transcription factor, an enzyme, 
including a kinase, a channel protein, or the like. 

The present invention also relates to a method of identifying a polynucleotide that . 

20 modulates a stress response in a plant cell. Such a method can be performed, for example, by 
contacting an array of probes representative of a plant cell genome and nucleic acid 
molecules expressed in plant cell exposed to the stress; detecting a nucleic acid molecule that 
is expressed at a level different from a level of expression in the absence of the stress; 
introducing the nucleic acid molecule that is expressed differently into a plant cell; and 

25 detecting a modulated response of the plant cell containing the introduced nucleic acid 

molecule to a stress, thereby identifying a polynucleotide that modulates a stress response in 
a plant cell. The contacting is under conditions that allow for specific hybridization of a 
nucleic acid molecule with probe having sufficient complementarity, for example, under 
stringent or highly stringent, hybridization conditions. 

30 As used herein, the term "array of probes representative of a plant cell genome" 

means an organized group of oligonucleotide probes that are linked to a solid support, for 
example, a microchip or a glass slide, wherein the probes can hybridize specifically and 
selectively to nucleic acid molecules expressed in a plant cell. Such an array is exemplified 
herein by a GeneChip® Arabidopsis Genome Array (Affymetrix; see Examples). In general, an 
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array of probes that is "representative" of a plant genome will identify at least about 30% of 
the expressed nucleic acid molecules in a plant cell, at least about 50% or 70%, at least about 
80% or 90%, or will identify all of the expressed nucleic acid molecules. It should be 
recognized that the greater the representation, the more likely all nucleotide sequences of 
5 cluster of stress-regulated nucleotide sequences will be identified. 

In addition, any polynucleotide of the present disclosure can be used for diagnostic 
purposes or to find related stress-responsive sequences in other species. Any polynucleotide 
provided herein may be attached in overlapping areas or at random locations on the solid 
support. Alternatively the polynucleotides of the invention may be attached in an ordered 

10 array wherein each polynucleotide is attached to a distinct region of the solid support that 
does not overlap with the attachment site of any other polynucleotide. In one instance, such 
an ordered array of polynucleotides is designed to be "addressable" where the distinct 
locations are recorded and can be accessed as part of an assay procedure. Addressable 
polynucleotide arrays typically include a plurality of different oligonucleotide probes that are 

15 coupled to a surface of a substrate in different known locations. The knowledge of the 
precise location of each polynucleotides location makes these "addressable" arrays 
particularly useful in hybridization assays. Any addressable array technology known in the 
art can be employed with the polynucleotides of the invention. One particular embodiment 
of these polynucleotide arrays is known as the Genechips™, and has been generally described 

20 in US Patent 5,143,854 and PCT publications WO 90/1 5070 and 92/10092. These arrays 
may generally be produced using mechanical synthesis methods or light directed synthesis 
methods that incorporate a combination of photolithographic methods and solid phase 
oligonucleotide synthesis. The immobilization of arrays of oligonucleotides on solid 
supports has been rendered possible by the development of a technology generally identified 

25 as "Very Large Scale Immobilized Polymer Synthesis" (VLSIPS™) in which, typically, 
probes are immobilized in a high density array on a solid surface of a chip. Examples of 
VLSIPS™ technologies are provided in US Patents 5,143,854 and 5,412,087 and in PCT 
Publications WO 90/15070, WO 92/10092 and WO 95/1 1995, which describe methods for 
forming oligonucleotide arrays through techniques such as light-directed synthesis 

30 techniques. Further presentation strategies aimed at providing arrays of nucleotides 

immobilized on solid supports were developed to order and display the oligonucleotide arrays 
on the chips in an attempt to maximize hybridization patterns and sequence information as 
disclosed in PCT Publications WO 94/12305, WO 94/1 1530, WO 97/29212 and WO 
97/31256. 
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In another embodiment, an oligonucleotide probe matrix may advantageously be used 
to detect mutations occurring in a polynucleotide disclosed hereiin. For this particular 
purpose, probes are specifically designed to have a nucleotide sequence allowing their 
hybridization to the genes that carry known mutations (either by deletion, insertion or 
5 substitution of one or several nucleotides). By "known mutations" it is meant, mutations of a 
polynucleotide including any of those disclosed herein, that have been identified using 
techniques known in the art. 

Another technique that is used to detect mutations in a polynucleotide including any 
stress-responsive sequence disclosed herein is the use of a high-density DNA array, where 

10 single base mutations are encompassed by this technique. Each oligonucleotide probe 
constituting a unit element of the high density DNA array is designed to match a specific 
subsequence of the genomic DNA or cDNA of interest. Thus, an array containing 
oligonucleotides complementary to subsequences of the target gene sequence is used to 
determine the identity of the target sequence with the "wild-type" nucleotide sequence, 

15 measure its amount, and detect differences between the target sequence and the reference 
wild-type nucleotide sequence. One such design termed a "4L tiled array", is implemented 
using a set of four probes (A, C, G, T), for example 15-nucleotide oligomers. In each set of 
four probes, the perfect complement will hybridize more strongly than mismatched probes. 
Consequently, a nucleotide target of length L is scanned for mutations with a tiled array 

20 containing 4L probes; the whole probe set containing all the possible mutations in the known 
wild reference sequence. The hybridization signals of the 15-mer probe set tiled array are 
perturbed by a single base change in the target sequence. As a consequence, there is a 
characteristic loss of signal or a "footprint" for the probes flanking a mutation position. This 
technique was described by Chee et al. (Science 274:610, 1996). 

25 Polynucleotides identified herein include those nucleotide sequences that are induced 

or repressed in response to a combination of stress conditions, but not to any of the stress 
conditions alone; and polynucleotides that are induced or repressed in response to a selected 
stress condition, but not to other stress conditions. Furthermore, polynucleotides whose 
response to a stress condition is temporally regulated are also included, such as 

30 polynucleotides that are induced early, late or continuously in a stress response. In addition, 
the polynucleotides are represented by a variety of cellular proteins, including transcription 
factors, enzymes such as kinases, channel proteins, and the like. 

The present invention additionally relates to a method of identifying a stress condition 
to which a plant cell was exposed. Such a method can be performed, for example, by 
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contacting nucleic acid molecules expressed in the plant cell with an array of probes 
representative of the plant cell genome; and detecting a profile of expressed nucleic acid 
molecules characteristic of a stress response, thereby identifying the stress condition to which 
the plant cell was exposed. The contacting generally is under conditions that allow for 
5 specific hybridization of a nucleic acid molecule with probes having sufficient 

complementarity, for example, under stringent or highly stringent hybridization conditions. 
The profile can be characteristic of exposure to a single stress condition, for example, an 
abnormal level of cold, osmotic pressure, or salinity, or can be characteristic of exposure to 
more than one stress condition, for example, cold, increased osmotic pressure and increased 
10 salinity. 

The polynucleotides for which expression is determined and so the probes used may 
be varied depending on the particular plant and/or stress involved. In one embodiment, the 
plant is a cereal and expression is determined for at least one polynucleotide selected from 
the group consisting of those sequences disclosed herein. In another embodiment, the plant is 

1 5 a rice plant. It will be apparent to those of skill in the art, that though the use of various 
technologies, for example microarrays, it is possible and in many cases desirable to 
determine the expression of multiple stress-regulated polynucleotides at once. Thus, the 
preceding various embodiments include the identification of a stress condition to which a 
plant was exposed in which expression data is obtained on at least 10, at least 25, at least 50, 

20 at least 100, at least 250, at least 500, or at least 750 of the various groups of polynucleotide 
sequences described above. 

In one embodiment, the expression profile is produced by isolating RNA, for example 
mRNA from the test plant. Methods for the isolation of RNA from plants are well known in 
the art and can be found in standard reference texts such as those cited herein. In one 

25 embodiment, the RNA is transformed into cDNA by the use of reverse transcriptase using 
protocols that are well known to those skilled in the art of molecular biology. The RNA or 
cDNA is then hybridized to probes to the stress-regulated polynucleotides described herein 
under stringent, high stringency, or very high stringency conditions and hybridization 
detected. It is envisioned that multiple probes will be used for each polynucleotide expressed 

30 and that expression of multiple polynucleotides will be determined as detailed above. 

The method can be used to determine exposure to any stress to which results in 
altered expression of the described polynucleotide sequences. In one embodiment the stress 
is a single or combination abiotic stress such as cold stress, saline stress, osmotic stress or 
any combination thereof. In one embodiment, the expression profile from the test plant is 
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also compared to a control plant of the same species, for example an isogenic plant, that has 
not been exposed to a stress. 

In one embodiment of the invention, nucleic acid samples from the plant cells to be 
collected can be contacted with an array, then the profile can be compared with known 
5 profiles prepared from nucleic acid samples of plants exposed to known stresses. By creating 
a panel of such profiles, representative of various stress conditions, an unknown stress 
condition to which a plant was exposed can be identified simply by comparing the unknown 
profile with the known profiles and determining which known profile that matches the 
unknown profile. In one embodiment, the comparison is automated. Such a method can be 

1 0 useful, for example, to identify a cause of damage to a crop, where the condition causing the 
stress is not known or gradually increases over time. For example, accumulation in soils over 
time of salts from irrigation water can result in gradually decreasing crop yields. Because the 
accumulation is gradual, the cause of the decreased yield may not be readily apparent. Using 
the present methods, it is possible to evaluate the stress to which the plants are exposed, thus 

1 5 revealing the cause of the decreased yields. 

The present invention, therefore includes a computer readable medium containing 
executable instructions form receiving expression data for sequences substantially similar to 
any of those disclosed herein and comparing expression data from a test plant to a reference 
plant th3t has been exposed to an abiotic stress. Also provided is a computer-readable 

20 medium containing sequence data for sequences substantially similar to any of the sequences 
described herein, or the complements thereof, and a module for comparing such sequences to 
other nucleic acid sequences. 

Also provided are plants and plant cells comprising plant stress-regulatory elements 
of the present invention operably linked to a nucleotide sequence encoding a detectable 

25 signal. Such plants can be used as diagnostic or "sentinel plants" to provide early warning 
that nearby plants are being stressed so that appropriate actions can be taken. In one 
embodiment, the signal is one that alters the appearance of the plant. For example, an 
osmotic stress regulatory element of the present invention can be operably linked to a 
nucleotide sequence encoding a fluorescent protein such as green fluorescent protein, a 

30 yellow fluorescent protein, a cyan fluorescent protein, or a red fluorescent protein. When 
subjected to osmotic stress, the expression of the green fluorescent protein in the sentinel 
plant provides a visible signal so that appropriate actions can be taken to remove or alleviate 
the stress. The use of fluorescent proteins in plants is known in the art and can be found, for 
example, in Leffel et al., Biotechniques 23:912, 1997. 
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The invention further relates to a method of identifying an agent that modulates the 
activity of a stress-regulated regulatory element of a plant. As used herein, the term 
"modulate the activity," when used in reference to a plant stress-regulated regulatory element, 
means that expression of nucleotide sequence from the regulatory element is increased or 
5 decreased. In particular, expression can be increased or decreased with respect to the basal 
activity of the promoter, i.e., the level of expression, if any, in the absence of a stress 
condition that normally induces expression from the regulatory element; or can be increased 
or decreased with respect to the level of expression in the presence of the inducing stress 
condition. As such, an agent can act as a mimic of a stress condition, or can act to modulate 

10 the response to a stress condition. 

Such a method can be performed, for example, by contacting the regulatory element 
with an agent suspected of having the ability to modulate the activity of the regulatory 
element, and detecting a change in the activity of the regulatory element. In one 
embodiment, the regulatory element can be operatively linked to a heterologous 

1 5 polynucleotide encoding a reporter molecule, and an agent that modulates the activity of the 
stress-regulated regulatory element can be identified by detecting a change in expression of 
the reporter molecule due to contacting the regulatory element with the agent. Such a method 
can be performed in vitro in a plant cell-free system, or in a plant cell in culture or in a plant 
in situ. 

20 A method of the invention also can be performed by contacting the agent with a 

genetically modified cell or a transgenic plant containing an introduced plant stress-regulated 
regulatory element, and an agent that modulates the activity of the regulatory element is 
identified by detecting a phenotypic change in the modified cell or transgenic plant, 

A method of the invention can be performed in the presence or absence of the stress 

25 condition to which the particularly regulatory element is responsive. As such, the method 
can identify an agent that modulates the activity of plant stress-regulated promoter in 
response to the stress, for example, an agent that can enhance the stress response or can 
reduce the stress response. In particular, a method of the invention can identify an agent that 
selectively activates the stress-regulated regulatory elements of a cluster of plant stress- 

30 regulated nucleotide sequences, but does not affect the activity of other stress-regulated 
regulatory olynucleotides. As such, the method provides a means to identify an agent that 
acts as a stress mimic. Such agents can be particularly useful to prepare a plant to an 
expected stress condition. For example, a agent that acts as a cold mimic can be applied to a 
field of plants prior to the arrival of an expected cold front. Thus, the cold stress response 
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can be induced prior to the actual cold weather, thereby providing the plants with the 
protection of the stress response, without the plants suffering from any initial damage due to 
the cold. Similarly, an osmotic pressure mimic can be applied to a crop of plants prior a field 
being flooded by a rising river. 
5 In one embodiment, the present invention provides a method for marker-assisted 

selection. Marker-assisted selection is a well-known method in the art and involves the 
selection of plant having desirable phenotypes based on the presence of particular nucleotide 
sequences known as markers. The use of makers allows plants to be selected early in 
development, often before the phenotype would normally manifest itself. Because it allows 

10 for early selection, marker-assisted selection decreases the amount of time need for selection 
and thus allows more rapid genetic progress. Briefly, marker-assisted selection involves 
obtaining nucleic acid from a plant to be selected. The nucleic acid obtained is then probed 
with probes that selectively hybridize under stringent or highly stringent, conditions to a 
nucleotide sequence or sequences associated with the desired phenotype. In one 

15 embodiment, the probes hybridize to any of the stress-responsive nucleotide sequences or 
regulatory elements disclosed herein. The presence of any hybridization products formed is 
detected and plants are then selected on the presence or absence of the hybridization 
products. 

The isolated polynucleotides of the invention can be used to create various types of 
20 genetic and physical maps of the genome of rice or other plants. Such maps are used to 
devise positional cloning strategies for isolating novel genes from the mapped crop species. 
The sequences of the present invention are also useful for chromosome mapping, 
chromosome identification, tagging genes of known and useful function, tagging genes to 
which a function has not yet been assigned, and including the uses set forth in U.S. Patent 
25 No. 5,817,479. 

In addition, because the genomes of closely related species are largely syntenic (that 
is, they display the same ordering of genes within the genome), these maps can be used to 
isolate novel alleles from wild relatives of crop species by positional cloning strategies. This 
shared synteny is very powerful for using genetic maps from one species to map genes in 
30 another. For example, a gene mapped in rice provides information for the gene location in 
maize and wheat. 

In one embodiment, the stress responsive sequences of the present invention are 
located in and can be used to identify Quantitative Trait Loci (QTLs) for a variety of uses, 
including marker-assisted breeding. Many important crop traits are quantitative traits and 
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result from the combined interactions of several genes. These genes reside at different loci in 
the genome, often on different chromosomes, and generally exhibit multiple alleles at each 
locus. Developing markers, tools, and methods to identify and isolate the QTLs involved in a 
trait, enables marker-assisted breeding to enhance desirable traits or suppress undesirable 
traits. The sequences of the invention can be used to identify QTLs and isolate alleles as 
described by Li et al. in a study of QTLs involved in resistance to a pathogen of rice. (Li et 
aL Mol Gen Genet . 261:58, 1999). 

In particular SEQ ID Nos. listed in Table 1 have been mapped to a drought resistance 
OTL located on chromosome 8 of rice (Zhang et al., Thero. Appl. Genet.. 103:19-29, 2000). 
This OTL is syntenic with drought resistance OTLs located on wheat chromosome 7S and 
barley chromosome 1. In addition to supporting the role of these sequences in drought 
resistance, these data show that support a role for these sequences in drought resistance for a 
variety of cereals. Likewise, additional SEQ ID Nos. listed in Table 1 are located in the 
drought resistance QTL on chromosome 3 of rice (Zhang et al, Thero. Appl. Genet .. 103:19- 
29, 2000). This QTL is syntenic with a drought resistance QTL located on maize 
choromosome 1 again, supporting the role of the polynucleotides in drought resistance in a 
variety of species. 

In addition to isolating QTL alleles in rice, other cereals, and other monocot and dicot 
crop species, the sequences of the invention can also be used to isolate alleles from the 
corresponding QTL(s) of wild relatives. Transgenic plants having various combinations of 
QTL alleles can then be created and the effects of the combinations measured. Once an ideal 
allele combination has been identified, crop improvement can be accomplished either through 
biotechnological means or by directed conventional breeding programs. (Flowers et al, J 
Exp Bot, 51:99, 2000; Tanksley and McCouch, Science, 277:1063, 1997). 

Polynucleotides derived from sequences of the present invention are useful to detect 
the presence in a test sample of at least one copy of a nucleotide sequence containing the 
same or substantially the same sequence, or a fragment, complement, or variant thereof. The 
sequence of the probes and/or primers of the instant invention need not be identical to those 
provided in the Sequence Listing or the complements thereof. Some variation in probe or 
primer sequence and/or length can allow additional family members to be detected, as well as 
orthologous genes and more taxonomically distant related sequences. Similarly probes 
and/or primers of the invention can include additional nucleotides that serve as a label for 
detecting duplexes, for isolation of duplexed polynucleotides, or for cloning purposes. 
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Probes and primers of the invention include isolated, purified, or recombinant 
polynucleotides containing a contiguous span of between at least 1 2 to at least 1000 
nucleotides of any of sequences disclosed herein or the complements thereof, with each 
individual number of nucleotides within this range also being part of the invention. 
5 Examples are isolated, purified, or recombinant polynucleotides containing a contiguous span 
of at least 12, 15, 18, 20, 25, 30, 35, 40, 50, 60, 70, 80, 90, 100, 150, 200, 300, 400, 500, 750, 
or 1000 nucleotides of any of the sequences disclosed herein or the complements thereof. 
The appropriate length for primers and probes will vary depending on the application. For 
use as PCR primers, probes are 12-40 nucleotides, typically 1 8-30 nucleotides long. For use 

10 in mapping, probes are 50 to 500 nucleotides, typically 100-250 nucleotides long. For use in 
Southern hybridizations, probes as long as several kilobases can be used. The appropriate 
length for primers and probes under a particular set of assay conditions may be empirically 
determined by one of skill in the art. 

The primers and probes can be prepared by any suitable method, including, for 

15 example, cloning and restriction of appropriate sequences and direct chemical synthesis by a 
method such as the phosphodiester method of Narang et al. (Meth Enzvmol. 68: 90, 1979), 
the diethylphosphoramidite method, the triester method of Matteucci et al. (J Am Chem Soc. 
103: 3185, 1981), or according to Urdea et al. (Proc Natl Acad Sci. USA. 80: 7461, 1981), 
the solid support method described in EP 0 707 592, or using commercially available 

20 automated oligonucleotide synthesizers. 

Detection probes are generally nucleotide sequences or uncharged nucleotide analogs 
such as, for example peptide nucleotides which are disclosed in International Patent 
Application WO 92/20702, morpholino analogs which are described in U.S. Patent Nos. 
5,1 85,444, 5,034,506 and 5,142,047. The probe may have to be rendered "non-extendable" 

25 such that additional dNTPs cannot be added to the probe. Analogs are usually non- 
extendable, and nucleotide probes can be rendered non-extendable by modifying the 3' end 
of the probe such that the hydroxyl group is no longer capable of participating in elongation. 
For example, the 3' end of the probe can be functionalized with the capture or detection label 
to thereby consume or otherwise block the hydroxyl group. Alternatively, the 3' hydroxyl 

30 group simply can be cleaved, replaced or modified so as to render the probe non-extendable. 
Any of the polynucleotides of the present invention can be labeled, if desired, by 
incorporating a label detectable by spectroscopic, photochemical, biochemical, 
immunochemical, or chemical means. For example, useful labels include radioactive 
substances ( 32 P, 35 S, 3 H, !25 I), fluorescent dyes (5-bromodesoxyuridine, fluorescein, 
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acetylaminofluorene, digoxigenin) or biotin. In one embodiment, polynucleotides are labeled 
at their 3' and 5' ends. Examples of non-radioactive labeling of nucleotide fragments are 
described in the French patent No. FR-7810975 and by Urdea et al. (Nuc Acids Res. 16:4937, 
1988). In addition, the probes according to the present invention may have structural 
5 characteristics such that they allow the signal amplification, such structural characteristics 
being, for example, branched DNA probes as described in EP 0 225 807. 

A label can also be used to capture the primer so as to facilitate the immobilization of 
either the primer or a primer extension product, such as amplified DNA, on a solid support. 
A capture label is attached to the primers or probes and can be a specific binding member that 

1 0 forms a binding pair with the solid's phase reagent's specific binding member, for example 
biotin and streptavidin. Therefore depending upon the type of label carried by a 
polynucleotide or a probe, it may be employed to capture or to detect the target DNA. 
Further, it will be understood that the polynucleotides, primers or probes provided herein, 
may, themselves, serve as the capture label. For example, in the case where a solid phase 

1 5 reagent's binding member is a nucleotide sequence, it may be selected such that it binds a 
complementary portion of a primer or probe to thereby immobilize the primer or probe to the 
solid phase. In cases where a polynucleotide probe itself serves as the binding member, those 
skilled in the art will recognize that the probe will contain a sequence or "tail" that is not 
complementary to the target. In the case where a polynucleotide primer itself serves as the 

20 capture label, at least a portion of the primer will be free to hybridize with a nucleotide on a 
solid phase. DNA labeling techniques are well known in the art. 

Any of the polynucleotides, primers and probes of the present invention can be 
conveniently immobilized on a solid support. Solid supports are known to those skilled in 
the art and include the walls of wells of a reaction tray, test tubes, polystyrene beads, 

25 magnetic beads, nitrocellulose strips, membranes, microparticles such as latex particles, 

sheep (or other animal) red blood cells, duracytes and others. The solid support is not critical 
and can be selected by one skilled in the art. Thus, latex particles, microparticles, magnetic 
or non-magnetic beads, membranes, plastic tubes, walls of microtiter wells, glass or silicon 
chips, sheep (or other suitable animal's) red blood cells and duracytes are all suitable 

30 examples. Suitable methods for immobilizing nucleotides on solid phases include ionic, 

hydrophobic, covalent interactions and the like. A solid support, as used herein, refers to any 
material that is insoluble, or can be made insoluble by a subsequent reaction. The solid 
support can be chosen for its intrinsic ability to attract and immobilize the capture reagent. 
Alternatively, the solid phase can retain an additional receptor that has the ability to attract 
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and immobilize the capture reagent. The additional receptor can include a charged substance 
that is oppositely charged with respect to the capture reagent itself or to a charged substance 
conjugated to the capture reagent. As yet another alternative, the receptor molecule can be 
any specific binding member which is immobilized upon (attached to) the solid support and 
5 which has the ability to immobilize the capture reagent through a specific binding reaction. 
The receptor molecule enables the indirect binding of the capture reagent to a solid support 
material before the performance of the assay or during the performance of the assay. The 
solid phase thus can be a plastic, derivatized plastic, magnetic or non-magnetic metal, glass 
or silicon surface of a test tube, microtiter well, sheet, bead, microparticle, chip, sheep (or 

10 other suitable animal's) red blood cells, duracytes and other configurations known to those of 
ordinary skill in the art. The polynucleotides of the invention can be attached to or 
immobilized on a solid support individually or in groups of at least 2, 5, 8, 10, 12, 15, 20, or 
25 distinct polynucleotides of the invention to a single solid support. In addition, 
polynucleotides other than those of the invention maybe attached to the same solid support 

1 5 as one or more polynucleotides of the invention. 

Probes and primers of the invention can be used to identify and/or isolate 
polynucleotides related to the stress responsive sequences provided in the Sequence Listing, 
or allelic variants of the stress responsive sequences. Generally, related polynucleotides have 
similar sequences or encode polypeptides with similar biological activity, but are found at 

20 other loci within an organism, or are found in other organisms. Identification and isolation of 
related sequences can provide important tools for functional genomics, to study the evolution 
of genomes, and to predict gene and protein function, interaction, and regulation. Related 
sequences including paralogs and orthologs are particularly important in identifying Clusters 
of Orthologous Groups (COGs) of proteins, which aids protein function prediction and the 

25 functional and phylogenetic annotation of newly sequenced genomes. 

Hybridization of the stress-responsive sequences of the invention to nucleotides 
obtained from other organisms can be used to identify and isolate paralogous sequences, or 
paralogs, which are additional members of gene families. The terms "paralogous sequence" 
and "paralog" as used herein encompass both full-length genes and regions and fragments 

30 thereof. Paralogs may be located in the same or a different region of the genome in which the 
sequence used as a probe is located. Paralogs generally have a high sequence identity with 
the probe sequence or the gene from which the probe was prepared; however, paralogs may 
have overall sequence identity with a probe sequence as low as 20 to 30% and still be 
recognizable as members of the same gene family with similar functions, as reported by 
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Takata et al. for RAD51B paralogs flVlol Cell Biol. 20:6476, 2000). When overall sequence 
identity is not high, the sequence similarity among paralogs of a gene family is often 
concentrated into one or a few portions of the sequence, notably in a portion encoding a 
protein or RNA that has an enzymatic or structural function. The degree of identity in the 
5 amino acid sequence of the domain that defines the gene family can be as low as 20%, but is 
often at least 50%, at least 75%, at least 80 to 95%, or at least 85 to 99%. Paralogs may 
differ in their expression profiles, indicating that they may act at different time, a different 
place, or at a different developmental stage, even when their function appears to be similar. 
Differences in function among paralogs may suggest that paralogs encode polypeptides that 

10 are "remodeled" during plant evolution, for example to create new forms of oxidized 
carotenoids in tomato as described by Bouvier et al. (Eur J Biochem, 267:6346, 2000). 

In one embodiment, paralogs may be isolated by hybridizing a stress-resonsive 
sequence probe to a Southern blot containing the appropriate genomic DNA or cDNA of the 
organism. To search for paralogs within a species, low stringency hybridization is usually 

1 5 performed, but will depend on size, distribution and degree of sequence di vergence.of 

domains that define the gene family. Given the resulting hybridization data, one or ordinary 
skill in the art could distinguish and isolate the correct DNA fragments by size, restriction 
sites and stated hybridization conditions from a gel or from a library. Alternately, paralogs 
may be isolated by large-scale sequencing followed by BLAST analysis of sequences to 

20 identify putative paralogs, as described by Ospina-Giraldo et at (Fungal Genet Biol, 29:81, 
2000). In another embodiment, paralogs may be isolated using reverse-transcriptase 
polymerase chain reaction (RT-PCR) using primers to conserved regions of sequence. 
Paralogs may be cloned using standard techniques to screen libraries using at least one stress- 
responsive sequence as a probe. 

25 The stress-responsive sequences disclosed herein can also be used to determine 

orthologous sequences. An orthologous sequence, or orthologous gene, or ortholog, has a 
high degree of sequence similarity to a known sequence or gene of interest, with the 
similarity often occurring along the entire length of the coding portion of the gene. The terms 
"orthologous sequence" and "ortholog" as used herein encompass both full-length genes and 

30 functional regions and fragments thereof An ortholog often encodes a gene product that 

performs a similar function in the organism. Functions for orthologous genes are expected to 
be the same as or very similar to that of the gene from which the probe was prepared. The 
degree of identity is a function of evolutionary separation and, in closely related species, the 
degree of sequence identity can be 98 to 100%. Orthologous sequences sometimes have 
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significantly lower levels of sequence identity, for example as described by Weise et al. 
(Plant Cell 12:1345, 2000) where orthologs of sucrose transporters from Arabidopsis, 
tomato, and potato had 47% similarity to the previously characterized sucrose transporter. 
The amino acid sequence of a protein encoded by an orthologous gene can be less than 50% 
5 identical, but tends to be at least 50%, or at least 70% or at least 80% identical, or at least 
90%, or at least 95% identicaJ to the amino acid sequence of the reference protein. 

To find orthologs, probes are hybridized to nucleotides from a species of interest 
under low stringency conditions and blots are then washed under conditions of increasing 
stringency. It is preferable that the wash stringency be such that sequences that are 85 to 

10 100% identical will hybridize. More preferably, sequences 90 to 100% identical will 

hybridize and most preferably only sequences greater than 95% identical will hybridize. The 
low stringency condition is preferably one where sequences containing as much as 40-45% 
mismatches will be able to hybridize. This condition is established by T m - 40°C to T m - 
48°C. One of ordinary skill in the art will recognize that, due to degeneracy in the genetic 

1 5 code, amino acid sequences that are identical can be encoded by DNA sequences as little as 
67% identical. Thus, it is preferable to make an overlapping series of shorter probes, on the 
order of 24 to 45 nucleotides, and individually hybridize them to the same arrayed library to 
avoid the problem of degeneracy introducing large numbers of mismatches. 

In one embodiment, orthologous sequences, or orthologs, may be isolated by 

20 hybridizing an stress-responsive sequence probe to a Southern blot containing the appropriate 
genomic DNA or cDNA of a different organism, for example, another cereal. Alternately, 
orthologs may be isolated by large-scale sequencing followed by BLAST analysis of 
sequences to identify putative orthologous sequences and full-length orthologs. In another 
embodiment, orthologs may be isolated using reverse-transcriptase polymerase chain reaction 

25 (RT-PCR) using primers to conserved regions of sequence in a stress responsive sequence. 
Orthologs and/or orthologous sequences may be cloned using standard techniques to screen 
libraries using at least one stress-responsive sequence as a probe. 

As evolutionary divergence increases, genome sequences also tend to diverge. Thus, 
one of skill will recognize that searches for orthologous genes between more divergent 

30 species will require the use of lower stringency conditions compared to searches between 

closely related species. Also, degeneracy is more of a problem for searches in the genome of 
a species more distant evolutionarily from the species that is the source of the stress- 
responsive probe sequences. 
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Identification of the relationship of nucleotide or amino acid sequences among plant 
species can be done by comparison of the subject nucleotide or amino acid sequence with the 
sequences of the present application presented in the Sequence Listing. 

Sequences disclosed herein can also be used to isolate corresponding DNA by 
5 Southern blotting. Probes for Southern blotting to distinguish individual restriction 
fragments can range in size from 15 to 20 nucleotides to several thousand nucleotides. 
Typically, the probe is 100 to 1000 nucleotides long for identifying members of a gene 
family when it is found that repetitive sequences would complicate the hybridization. For 
identifying an entire corresponding gene in another species, the probe is more often the 

1 0 length of the gene, typically 2000 to 1 0,000 nucleotides, but probes 50- 1 ,000 nucleotides 
long might be used. Some genes, however, might require probes up to 15,000 nucleotides 
long or overlapping probes constituting the full-length sequence to span their lengths. 

In one embodiment, the probe derived from sequences of the present invention is 
homogeneous, having a single sequence. In another embodiment, a probe designed to 

1 5 represent or identify members of a gene family having diverse sequences can be generated 
using PCR to amplify genomic DNA or RNA templates using primers derived from stress- 
responsive sequences that include sequences that define the gene family. 

For identifying corresponding genes in another species, the probe for Southern 
blotting most preferably would be the genomic copy of the probe gene. This allows all 

20 elements of the gene to be identified in the other species. The next most preferable probe is a 
cDNA spanning the entire coding sequence which allows the entire mRNA-coding portion of 
the gene to be identified; in this case it is possible that some introns in the gene might be 
missed. Probes for Southern blotting can easily be generated from stress-responsive 
sequences by making primers having the sequence at the ends of the sequence and using rice 

25 (Oryza sativa) genomic DNA as a template. In instances where the sequence includes 

sequence conserved among species, primers including the conserved sequence can be used 
for PCR with genomic DNA from a species of interest to obtain a probe. Similarly, if the 
sequence includes a domain of interest, that portion of the sequence can be used to make 
primers and, with appropriate template DNA, used to make a probe to identify genes 

30 containing the domain. Alternatively, the PCR products can be resolved, for example by gel 
electrophoresis, and cloned and/or sequenced. In this manner, the variants of the domain 
among members of a gene family, both within and across species, can be examined. 

The sequences of the invention can be used for library screening to isolate the 
corresponding DNA from the same organism or other organisms. Either cDNA or genomic 
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DNA can be isolated. Libraries of genomic DNA, or lambda, cosmid, BAC or YAC, or other 
large insert genomic library from the plant of interest can be constructed using standard 
molecular biology techniques as described in detail by Sambrook et al., (Molecular Cloning: 
A Laboratory Manual. Cold Spring Harbor Laboratory Press, 1989) and by Ausubel et al. 
5 (Current Protocols in Molecular Biology . Greene Publishing, 1992) with updates). 

To screen a phage library, recombinant lambda clones are plated out on appropriate 
bacterial medium using an appropriate E. coli host strain. The resulting plaques are lifted 
from the plates using nylon or nitrocellulose filters. The plaque lifts are processed through 
denaturation, neutralization, and washing treatments following the standard protocols 

10 outlined by Ausubel et al. (1992 supra). The plaque lifts are hybridized to either 
radioactively labeled or non-radioactively labeled stress responsive DNA at room 
temperature for about 16 hours, usually in the presence of 50% formamide and 5X SSC 
(sodium chloride and sodium citrate) buffer and blocking reagents. The plaque lifts are then 
washed at 42°C with 1% sodium dodecyl sulfate (SDS) and at a particular concentration of 

15 SSC. The SSC concentration used is dependent upon the stringency at which hybridization 
occurred in the initial Southern blot analysis performed. For example, if a fragment 
hybridized under medium stringency such as T m - 20°C, then this condition is maintained or 
adjusted to a less stringent condition such as T m - 30°C, to wash the plaque lifts. Positive 
clones showing hybridization to the probe are detected by exposure to X-ray films or 

20 chromogen formation or any other suitable detection method, and subsequently isolated for 
purification using the same general protocol outlined above. Once the clone is purified, 
restriction analysis can be conducted to narrow the region corresponding to the gene of 
interest. Restriction analysis and succeeding subcloning steps can be done using procedures 
described by, for example, Sambrook et al. (1989, supra). 

25 To screen a YAC library, the procedures outlined for the lambda library are 

essentially similar except the YAC clones are harbored in bacterial colonies. The YAC 
clones are plated out at reasonable density on nitrocellulose or nylon filters supported by 
appropriate bacterial medium in petri plates. Following the growth of the bacterial clones, 
the filters are processed through the denaturation, neutralization, and washing steps following 

30 the procedures of Ausubel et al. (1992, supra). The same hybridization procedures for 
lambda library screening are followed. 

To isolate cDNA, similar procedures using appropriately modified vectors are 
employed. For instance, the library can be constructed in a lambda vector appropriate for 
cloning cDNA such as Xgtl 1 . Alternatively, the cDNA library can be made in a plasmid 
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vector. cDNA for cloning can be prepared by any of the methods known in the art, but is 
preferably prepared as described above. Preferably, a cDNA library will include a high 
proportion of full-length clones. 

Identification and isolation of alleles and paralogs within a species, and orthologs 
5 from other species, is particularly desirable because of their potential use as a tool for crop 
improvement especially for quantitative traits such as resistance to abiotic stress. By 
identifying and isolating numerous alleles for each locus from a single species or from 
different species, transgenic plants having various combinations of alleles can be created and 
the effects of the combinations measured. Once a more favorable ideal allele combination 

10 has been identified, crop improvement can be accomplished either through biotechnological 
means or by traditional (conventional) breeding programs. (Tanksley et al., Science 
277:1063, 1997). In a similar manner, substitution of at least one paralogous or orthologous 
sequence in at least one locus will introduce diversity at each substituted locus, and the 
substituted sequences will contribute to a trait that is influenced by combined interactions of 

1 5 the products of several genes residing at different loci in the genome. When favorable 
combinations of substituted sequences and endogenous sequences at loci whose products 
interact are identified, crop improvement can be accomplished through further 
biotechnological manipulation or by traditional breeding programs, including sexual crossing 
and also apomixis. 

20 The results from hybridization of the sequences of the invention to Southern blots 

containing DNA from another species can be used to generate restriction fragment maps for 
the corresponding genomic regions. These maps provide additional information about the 
relative positions of restriction sites within fragments, further distinguishing mapped DNA 
from the remainder of the genome. Physical maps can be made by digesting genomic DNA 

25 with different combinations of restriction enzymes. 

Sequence analysis and mapping of related sequences (paralogs and orthologs) can be 
used in phylogenetic analyses of the evolution of the sequences in question, including the 
determination of gene duplication and rearrangements. In addition, expression studies of 
related sequences can be used to further understand the evolutionary history and function of 

30 the paralogs and orthologs, and to suggest future uses for the sequences. 

Isolated polynucleotides within the scope of the invention also include allelic variants 
of the specific sequences presented in the sequence listing. An "allelic variant" is a sequence 
that is a variant from that of the stress-responsive sequence, but represents the same 
chromosomal locus in the organism. Allelic variants can arise by normal genetic variation in 
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a population. Allelic variants can also be produced by genetic engineering methods, for 
example by the method of chimeraplasty using chimeric oligonucleotides to introduce a 
single nucleotide base substitution in a target sequence, as described by Beetham et al. (Proc. 
Natl. Acad. Sci. USA . 96:8774, 1999) and Zhu et al., fProc. Natl. Acad. ScL USA , 96:8768, 
5 1999). An allelic variant can be one that is found naturally occurring in a plant, including a 
cultivar or ecotype. An allele can give rise to detectably distinct phenotypic and expression 
profiles. An allelic variant may or may not give rise to a phenotypic change, and may or may 
not be expressed. An expressed allele can result in a detectable change in the phenotype of 
the trait represented by the locus. Allelic variations can occur in any portion of the gene 

10 sequence, including regulatory regions as well as structural regions. The stress responsive 
sequences of the present invention are useful to detect and/or isolate allelic variants, and may 
be used to introduce an allelic variant at a locus. Thus, present sequences can be used to 
manipulate the allelic diversity of a plant or a population. 

With respect to nucleotide sequences, degeneracy of the genetic code provides the 

1 5 possibility to substitute at least one base of the base sequence of a gene with a different base 
without causing the amino acid sequence of the polypeptide produced from the gene to be 
changed. Hence, the sequences of the present invention may also have any base sequence 
that has been changed from a sequence as provided in the Sequence Listing by substitution in 
accordance with degeneracy of genetic code. References describing codon usage include: 

20 Carels et al., JMol Evol 46:45, 1998 and Fennoy et al., Nucl Acids Res . 21:5294, 1993. 
One embodiment of the invention comprises stress-responsive polypeptides 
containing a universal stress protein A (USPA) domain and polynucleotides encoding said 
polypeptides. The gene encoding proteins containing the USPA domain was originally found 
in E. coli (Nystrom and Neidhardt, Mol. Microbiol. 6:3187-3198, 1992). The uspA gene is 

25 unique in its almost universal responsiveness to diverse stresses. Rice stress responsive 
polypeptides containing a USPA domain are listed in Table 1. 



Examples 

The following examples are intended to provide illustrations of the application of the 
30 present invention. The following examples are not intended td completely define or 
otherwise limit the scope of the invention. 
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Example 1: Isolation and sequencing of DNA fragments 
I. I Isolation and sequencing of genomic DNA fragments 

Genomic DNA was isolated from nuclei of Oryza sativa L. ssp japonica cv 
Nipponbare and then sheared to produce fragments of approximately 500 bp. Using a 
5 method derived from the method of Mao et al (Genome Res 10:982 (2000)), seeds were 
germinated on cheese cloth immersed in water and grown for 4-6 weeks under greenhouse 
conditions. After plants reached a height of approximately 5-8 inches, the upper parts of the 
green leaves were harvested and wrapped in aluminum foil at 4°C overnight. Leaf material 
was then stored at -80°C or directly used for extraction of nuclei. Intact nuclei were isolated 

10 by homogenization (in a blender for fresh material or by grinding with mortar and pestle for 
frozen material) in a buffer containing 10 mM Trizma base, 80 mM KC1, 10 mM EOT A, 1 
mM spermidine, 1 mM spermine, 0.5 M sucrose, 0.5% Triton-X-100, 0.15% (5- 
mercaptoethanol pH 9.5. The homogenate was filtered and nuclei recovered by gentle 
centrifugation using a fixed-angle rotor at 1,800 g at 4°C for 20 minutes. The pellet 

1 5 recovered after centrifugation was gently resuspended with the assistance of a small paint 
brush soaked in ice cold wash buffer and wash buffer added. Particulate matter remaining in 
the suspension was removed by filtering the resuspended nuclei into a 50 ml centrifuge tube 
through two layers of miracloth by gravity and centrifuging the filtrate at 57 g (500 rpm), 4 C 
for 2 minutes to remove intact cells and tissue residues. The supernatant fluid was 

20 transferred into a fresh centrifuge tube and nuclei were pelleted by centrifugation at 1 ,800 g, 
4 C for 15 minutes in a swinging bucket centrifuge. 

DNA was isolated from the nuclear preparation by phenol/chloroform extraction, as 
in Sambrook et al (supra). Isolated total genomic DNA was physically sheared (Hydroshear) 
to generate for generating random DNA fragments, and fragments of approximately 500 bp 

25 were recovered. DNA was eluted and the ends filled in using T 4 DNA polymerase, Klenow 
fragments, and dNTPs. Double-stranded DNA was linkered and cloned into a proprietary 
medium-copy vector derived from pSClOl. 

Vector inserts were amplified by PCR and sequenced using the MegaB ACE 
sequencing system (Molecular Dynamics, Amersham). The amplification reaction was 

30 diluted before use and was not purified using an exonuclease/alkaline phosphatase procedure. 
Sequencing reactions were performed using DYEnamic ET Terminator Kit. The reactions 
contained approximately 50 ng of amplicon, DYEnamic ET Terminator premix, and 5 pmol 
of -40 M13 forward primer. The sequencing reaction is amplified for 30 cycles, and reaction 
products are concentrated and purified using ethanol precipitation. The sample was 
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electrokinetically injected into the capillary at 3 kV for 45 sec and separated via 

electrophoresis at 9 kV for 120 min. 

1.2 Isolation and sequencing of cDNA fragments 

Construction of rice cDNA library. Total RM A was purified from rice plant tissue 
5 using standard total RNA purification methods. PolyA+ RN A was isolated from the total 
RNA using the Qiagen Oligotex mRNA purification system (Qiagen, Valencia, CA), and 
cDNA was generated using cDNA synthesis reagents from Life Technologies (Rockville, 
MD). First strand cDNA synthesis was catalyzed by reverse transcriptase using oligo dT 
primers with a NotI restriction site. Second strand synthesis was catalyzed by DNA 

1 0 polymerase. An oligonucleotide linker with a Sail restriction endonuclease site was attached 
to the 5' end of the cDNAs using DNA ligase. The cDNAs were digested with NotI and Sail 
restriction endonucleases and inserted into an E. co/i-replicating plasmid harboring a 
selectable marker. E. coli was transfected with the recombinant plasmids and grown on 
selectable media. E. coli colonies were individually picked off the selectable media and 

1 5 placed into storage plates. 

Sequencing the rice cDNA library, The DNA sequence of the cDNA cloned into the 
plasmid purified from an E. coli colony was determined using standard dideoxy sequencing 
methods. Oligonucleotide primers respectively corresponding to plasmid DNA regions 
upstream of the 5' end of the cDNA insert (Forward reaction) and downstream of the 3' end 

20 of the cDNA insert (Reverse reaction) were used in the dideoxy sequencing reactions. If the 
DNA sequence determined as a result of the Forward and Reverse reactions from the cDNA 
overlapped, the two sequences could be merged into a contig using computerized analysis 
software (Consed, University of Washington,Seattle), to assemble a full-length sequence of 
the cDNA. In cases where DNA sequence from the Forward and Reverse reactions from a 

25 single clone did not overlap sufficiently to be assembled into a contig, such that there was a 
region of unsequenced DNA to bridge the DNA from the Forward and Reverse reaction in 
order to form a contig, the DNA sequence of the separating region was determined using one 
of two dideoxy sequencing methods. In a "primer walking" approach, a primer specifically 
corresponding to the 3' end of the DNA sequence determined from the Forward reaction was 

30 used in a second dedeoxy sequencing reaction. The primer walking procedure was repeated 
until the DNA sequence that separated the original Forward and Reverse was resolved and a 
contig could be assembled. Alternatively, the clone harboring the cDNA was subjected to 
transposon in vitro insertion dideoxysequencing (Epicentre, Madison, WI). In this procedure, 
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the insertion process was random and the result was multiple DNA sequence coverage over 
the targeted cDNA, where the sequences thus obtained were assembled into a contig. 

Example 2: GeneChip ® Standard Protocol 
5 The standard protocol for using the GeneChip® to quantitatively measure plant gene 

expression was carried out as outlined below: 
Quantitation of total RNA 

1 . Total RNA from plant tissue was extracted and quantified.Quantified total RNA 
using GeneQuant 

1 0 1 OD 2 6o=40 mg RNA/ml; A 2 6o/A 280 =l 9 to about 2.1 

2. Ran gel to check the integrity and purity of the extracted RNA 
Synthesis of double-stranded cDNA 

Gibco/BRL Superscript Choice System for cDNA Synthesis (Cat#lB090-019) was 
employed to prepare cDNAs. T7-(dT)24 oligonucleotides were prepared and purified by 
15 HPLC. (5'- GGCCAGTG AATTGT AAT ACGACTC ACTAT AGGGAGGCGG-(dT) 24 -3 ' ; 
SEQIDNO: 17507). 

Step 1. Primer hybridization: 

Incubated at 70°C for 10 minutes 
Spun quickly and put on ice briefly 
20 Step 2. Temperature adjustment: 

Incubated at 42°C for 2 minutes 
Step 3. First strand synthesis carried out using: 
DEPC-water- 1 :1 
RNA(10:gfinal)-10:l 
25 T7-(dT) 24 Primer (1 00 pmol final)- 1 :1 pmol 

5X 1 st strand cDNA bufTer-4 :1 
0.1MDTT (10 mM final)- 2:1 
10 mM dNTP mix (500 :M final)-l :1 
Superscript II RT 200 U/:l- 1 :1 
30 Total of 20:1 

Mixed well 

Incubated at 42°C for 1 hour 
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Step 4. Second strand synthesis: 

Placed reactions on ice, quick spin 
DEPC-water- 91 :1 
5X 2 nd strand cDNA buffer- 30 :l 
5 10mMdNTPmix(250mMfinal)-3 :1 

E. coli DNA ligase (10 U/:l)-l :! 
E. coli DNA polymerase 1-10 U/:l- 4 :l 
RnaseH2U/:l-l :1 
T4 DNA polymerase 5 U/:l-2 :1 
10 0.5 MEDTA (0.5 M final)— 10 :1 

Total 162:1 
Mixed/spun down/incubated 16°C for 2 hours 
Step 5. Completing the reaction: 

Incubated at 16°C for 5 minutes 
15 Purification of double stranded cDNA 

1. Centrifuged PLG (Phase Lock Gel, Eppendorf 5 Prime Inc., pi- 188233) at 14,000X, 

transfered 162 :1 of cDNA to PLG 

2. Added 162 :1 of Phenol:Chloroform:Isoarayl alcohol (pH 8.0), centrifuge 2 minutes 

3. Transfered the supernatant to a fresh 1.5 ml tube, add 

20 Glycogen (5 mg/ml) 2 

0.5 M NH 4 OAC (0.75xVol) 120 
ETOH (2.5xVol } -20°C) 400 

4. Mixed well and centrifuge at 14,000X for 20 minutes 

5. Removed supernatant, added 0.5 ml 80% EtOH (-20°C) 

25 6. Centrifuged for 5 minutes, air dry or by speed vac for 5- 1 0 minutes 

7. Added 44 :1 DEPC H 2 0 
Analyzed quantity and size distribution of cDNA 

Ran a gel using 1 :1 ratio of the double-stranded synthesis product to loading buffer 
Synthesis of biotinvlated cRNA 
30 (used Enzo BioArray High Yield RNA Transcript Labeling Kit 

Cat#900182) 

Purified cDNA 22 :1 

1 OX Hy buffer 4:1 
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10X biotin ribonucleotides 
10X DTT 

10X Rnase inhibitor mix 
20X T7 RNA polymerase 



4:1 
4:1 
4:1 
2:1 



Total 



40 :1 



Centrifuged 5 seconds, and incubated for 4 hours at 37°C 
Gently mixed every 30-45 minutes 
Purification and quantification of cRNA 

(used Qiagen Rneasy Mini kit Cat# 74103) 



cRNA • 
DEPC H 2 0 
RLT buffer 
EtOH 



Total 



40:1 
60:1 
350:1 
250 :1 
700:1 



mix by vortexing 
mix by pipetting 



Waited 1 minute or more for the RNA to stick 
Centrifuged at 2000 rpm for 5 minutes 

RPE buffer 500:1 
Centrifuged at 10,000 rpm for 1 minute 

RPE buffer 500:1 
Centrifuged at 10,000 rpm for 1 minute 
Centrifuged at 10,000 rpm for 1 minute to dry the column 

DEPCH 2 0 30:1 
Waited for 1 minute, then elute cRNA from by centrifugation, 10K 1 minute 

DEPC H 2 0 30 :1 

Repeated previous step 

Determined concentration and dilute to 1 :g/:l concentration 
Fragmentation of cRNA 

cRNA(l :g/:l) 15:1 

5X Fragmentation Buffer* 6 :1 

DEPCH 2 0 ill . 

30 :1 
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*5x Fragmentation Buffer 
lMTris(pH8.1) < 
MgOAc I 



0.64 g 
0.98 g 



4.0 ml 



KOAC 



DEPC H 2 0 



Total 



20 ml 



Filter Sterilize 



Array washed and stained in: 

Stringent Wash Buffer** 

Non-Stringent Wash Buffer*** 

SAPE Stain**** 

Antibody Stain***** 
Washed on fluidics station using the appropriate antibody amplification protocol 

**Stringent Buffer: 12X MES 83.3 ml, 5 M NaCl 5.2 ml, 10% Tween 1.0 ml, 
H 2 O910ml, 

Filter Sterilize 

***Non-Stringent Buffer: 20X SSPE 300 ml, 10% Tween 1 .0 ml, H 2 0 698 ml, 
Filter Sterilize, Antifoam 1.0. 
****SAPE stain: 2X Stain Buffer 600 :1, BSA 48 :1, SAPE 12:1, H 2 0 540 :1. 
*****Antibody Stain: 2X Stain Buffer 300 :1 3 H 2 0 266.4 :1, BSA 24 :l, Goat IgG 6 
:1, Biotinylated Ab 3.6 :1 

Example 3: Profiling of plant stress-regulated genes 

A GeneChip® Rice Genome Array (Affymetrix, Santa Clara, CA) was used to 
identify clusters of genes that were coordinately induced in response to various stress 
conditions. The GeneChip® Rice Genome Array contains probes synthesized in situ and is 
designed to measure temporal and spatial gene expression of approximately 18,000 genes 
which covers approximately 40-50% of the genome. . 

The Affymetrix GeneChip® array was used to define nucleotide sequences/ pathways 
affected by various abiotic stresses and to define which are uniquely regulated by one stress 
and those that respond to multiple stress, and to identify candidate nucleotide sequences for 
screening for insertional mutants. Of the approximately 18,000 nucleotide sequences 
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represented on the Affymetrix GeneChip® anay, certain nucleotide sequences showed at least 
a 2-fold change in expression in at least one sample, relative to no-treatment controls. 

The following describes in more detail how the experiments were done. Transcriptional 
profiling was performed by hybridizing fluorescence labeled cRNA with the oligonucleotides 
5 probes on the chip, washing, and scanning. Each gene is represented on the chip by about 
sixteen oligonucleotides (25-mers). Expression level is related to fluorescence intensity. 
Starting material contained 1 to 10 total RNA; detection specificity was about 1:10 6 ; 
approximately a 2-fold change was detectable, with less than 2% false positive; the dynamic 
range was approximately 500x. Nucleotide sequences having up to 70% to 80% identity could 
1 0 be discriminated using this system. 

3.1 Growth conditions 

Rice plants were grown for 6 weeks in convirons in plastic pots filled with sand. The 
conditions of the conviron are 12 h/12h light/dark, 25°C, -50% RH and light intensity at 300 
IlEL The plants were fertilized three times per week with one-half-strength Hoagland 
15 Solution containing 25 jiM KH 2 P0 4 . 

3.2 Abiotic stress treatment 

Six weeks after placing the rice plants in convirons, stresses were applied as follows: 
•Control - no treatment; 

-Drought - 25% PoIyEthyleneGlycol (PEG) 8000 (PEG is a more 
20 controllable method for creating a water-deficit, the osmotic pressure from PEG will 

mimic the water-deficit experienced during drought) 

Osmotic Stress = 260.0 mM Mannitol (equivalent osmolarity of a 150.0 mM 
NaCl solution) 

NaCl= 150.0 mM 

25 Cold = 14°C (the temperature at which pollen mother cell development is 

affected) 

The abiotic stress treatments was applied at time 0 and then at the same time of day 
on subsequent days (ie. Time 0, 24, 48 and 72 hours). 

3. 3 Tissue sampling 

30 After the onset of treatment, 3 time points were harvested, namely, 3hr, 27 hr and 75 

hr. Leaves and roots were harvested separately and the tissue flash-frozen in liquid nitrogen. 
These time points are set to be the exact same time of day at all 3 time points to eliminate the 
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effects of circadian rhythms in gene expression. RNA was purified, and the samples were 
analyzed using the GeneChip® Rice Genome Array (Affymetrix, Santa Clara, C A) following the 
manufacturer's protocol. 

5 3.4. Data analysis 

Raw fluorescence values as generated by Affymetrix software were processed as 
follows: the values were brought into Microsoft Excel® and values of 25 or less were set to 25 
(an empirically determined baseline as disclosed in Zhu and Wang, Plant Physiol. 124:1472- 
1476; 2000). The values from the stressed samples were then converted to fold change relative 
10 to control by dividing the values from the stressed samples by the values from the no-treatment 
control samples. Expression patterns that were altered at least 2-fold with respect to the control 
were selected. This method gave very robust results and resulted in a larger number of 
nucleotide sequences called as stress-regulated than previous methods had permitted. 

Based on the profiles obtained following hybridization of nucleic acid molecules 
1 5 obtained from plant cells exposed to various stress conditions to the probes in the microarray, 
clusters of nucleotide sequences that were altered at least two-fold in response to the stress 
conditions were identified. 

Example 4 Identification of Abiotic Shess Responsive Genes by Yeast Two Hybrid System 
20 An automated, high-throughput yeast two hybrid assay technology provided by Myriad 

Genetics Inc., (Salt Lake City, UT) was used to search for protein interactions with a bait protein 
known to be inducible by chilling in rice. 

Multiple prtein fragments that encode recognizable motifs, or domains, were constructed 
as baits from the ORF encoding the protein to be studied. A screening protocol, which uses 
25 Myriad's proprietary strains and vectors, was then used to search the individual baits against two 
activation domain libraries of greater than five million cDNA clones of assorted peptide motifs. 
The libraries were derived from RNA isolated from leaves, stems and roots of rice plants grown 
in normal conditions plus tissues form plants exposed to various stresses (input trait library) and 
from various seed stages, callus, and early and late panicle (output trait library). Both hybrid 
30 proteins were expressed in a yeast reporter strain where an interaction between the test proteins 
results in transcription of the reporter genes TRP1 and LEU2, allowing growth on selective 
medium lacking tryptophan and leucine. Positives obtained from these searches were isolated 
and their identity was determined by sequence analysis against proprietary and public nucleic 
acid and protein databases. 
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To further characterize the polynucleotides encoding interacting proteins, the sequences 
of the baits and preys were compared with the gene fragments represented on a proprietary 
GeneChip® Rice Genome Array (Affymetrix, Santa Clara, CA) and where a polynucleotide was 
identitified on the chip, its expression was experimentally determined. Experiments included 
5 evaluating differential gene expression from various plant tissues comprising seed, root, leaf and 
stem, panicle, and pollen. 

Using these methods, an ORF (SEQ ID NO: 17505) was identified encoding a protein 
that interacted with the chilling inducible bait. This ORF is contained in a full-length cDNA 
(SEQ ID NO: 17506) which also includes 258 bp of 5' UTR and 423 bp of 3 ' UTR. 

10 

Example 5: Rice Ortholoes of Arabidopsis Abiotic stress Genes Identified by Reverse 
Genetics 

Understanding the function of every gene is the major challenge in the age of 
completely sequenced eukaryotic genomes. Sequence homology can be helpful in 
15 identifying possible functions of many genes. However, reverse genetics, the process of 
identifying the function of a gene by obtaining and studying the phenotype of an individual 
containing a mutation in that gene, is another approach to identify the function of a gene. 

Reverse genetics in Arabidopsis has been aided by the establishment of large publicly 
available collections of insertion mutants (Krysan et al., Plant Cell 1 1 :2283-2290, 1999; 

20 Tisseret al.. Plant Cell 1 1:1841-1852, 1999; Speulman et al, PlantCeU 11:1853-1866, 1999; 
Parinov et al., Plant Cell 1 1 :2263-2270, 1999; Parinov and Sundaresan, Biotechnology . 
1 1:157-161, 2000). Mutations in genes of interest are identified by screening the population 
by PCR amplification using primers derived from sequences near the insert border and the 
gene of interest to screen through large pools of individuals. Pools producing PCR products 

25 are confirmed by Southern hybridization and further deconvolved into subpools until the 
individual is identified (Sussman et al, Plant Physiology. 124:1465-1467, 2000). 

Recently, some groups have begun the process of sequencing insertion site flanking 
regions from individual plants in large insertion mutant populations, in effect prescreening a 
subset of lines for genomic insertion sites (Parinov et al, Plant Cell 1 1 :2263-2270, 1999; 

30 Tisser et al, Plant Cell 1 1:1841-1852, 1999). The advantage to this approach is that the 

laborious and time-consuming process of PCR-based screening and deconvolution of pools is 
avoided. 

A large database of insertion site flanking sequences from approximately 100,000 T- 
DNA mutagenized Arabidopsis plants of the Columbia ecotype (GARLIC lines) is prepared. 
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T-DNA left border sequences from individual plants are amplified using a modified thermal 
asymmetric interlaced-polymerase chain reaction (TA1L-PCR) protocol (Liu et al., Plant J. . 
8:457-463, 1995). Left border TAIL-PCR products are sequenced and assembled into a 
database that associates sequence tags with each of the approximately 100,000 plants in the 
5 mutant collection. Screening the collection for insertions in genes of interest involves a 

simple gene name or sequence BLAST query of the insertion site flanking sequence database, 
and search results point to individual lines. Insertions are confirmed using PCR. 

Analysis of the GARLIC insert lines suggests that there are 76,856 insertions that 
localize to a subset of the genome representing coding regions and promoters of 22,880 

10 genes. Of these, 49,231 insertions lie in the promoters of over 18,572 genes, and an 
additional 27,625 insertions are located within the coding regions of 13,612 genes. 
Approximately 25,000 T-DNA left border mTAIL-PCR products (25% of the total 102,765) 
do not have significant matches to the subset of the genome representing promoters and 
coding regions, and are therefore presumed to lie in noncoding and/or repetitive regions of 

15 the genome. 

The Arabidopsis T-DNA GARLIC insertion collection is used to investigate the roles 
of certain genes in abiotic stress. Target genes are chosen using a variety of criteria, 
including public reports of mutant phenotypes, RNA profiling experiments, and sequence 
similarity to genes implicated in abiotic stress. Plant lines with insertions in genes of interest 
20 are then identified. Each T-DNA insertion line is represented by a seed lot collected from a 
plant that is hemizygous for a particular T-DNA insertion. Plants homozygous for insertions 
of interest are identified using a PCR assay. The seed produced by these plants is 
homozygous for the T-DNA insertion mutation of interest. 

Homozygous mutant plants are tested for altered stress response. The genes 
25 interrupted in these mutants contribute to the observed phenotype. The genes interrupted in 
these mutants interfere with the normal response of the plant to abiotic stresses. 

Rice orthologs of the Arabidopsis genes affecting the plants response to an abiotic 
stress are identified by similarity searching of a rice database using the Double-Affine Smith- 
Waterman algorithm (BLASP with e values better than " 10 ). 

30 

Example 6 : Cloning and Sequencing of Nucleic Acid Molecules from Rice 

6.1 Genomic DNA: Plant genomic DNA samples are isolated from a collection of 
tissues. Individual tissues are collected from a minimum of five plants and pooled. DNA can 
be isolated according to one of the three procedures, e.g., standard procedures described by 
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Ausubel et al. (1995), a quick leaf prep described by Klimyuk et ah (Plant J., 3:493-494, 
1993), or using FTA paper (Life Technologies. Rockville, MD). 

For the latter procedure, a piece of plant tissue such as, for example, leaf tissue is 
excised from the plant, placed on top of the FTA paper and covered with a small piece of 
5 parafilm that serves as a barrier material to prevent contamination of the crushing device. In 
order to drive the sap and cells from the plant tissue into the FTA paper matrix for effective 
cell lysis and nucleic acid entrapment, a crushing device is used to mash the tissue into the 
FTA paper. The FTA paper is air dried for an hour. For analysis of DNA, the samples can 
be archived on the paper until analysis. Two mm punches are removed from the specimen 

10 area on the FTA paper using a 2 mm Harris Micro Punch™ and placed into PCR tubes. Two 
hundred (200) microliters of FTA purification reagent is added to the tube containing the 
punch and vortexed at low speed for 2 seconds. The tube is then incubated at room 
temperature for 5 minutes. The solution is removed with a pipette so as to repeat the wash 
one more time. Two hundred (200) microliters of TE (10 mM Tris, 0.1 mM EDTA, pH 8.0) 

1 5 is added and the wash is repeated two more times. The PCR mix is added directly to the 
punch for subsequent PCR reactions. 

6.2 Cloning of Candidate cDNA: A candidate cDNA is amplified from total RNA 
isolated from rice tissue after reverse transcription using primers designed against the 
computationally predicted cDNA. Primers designed based on the genomic sequence can be 

20 used to PCR amplify the full-length cDNA (start to stop codon) from first strand cDNA 
prepared from rice cultivar Nipponbare tissue. 

The Qiagen RNeasy kit (Qiagen, Hilden, Germany) is used for extraction of total RNA. 
The Superscript II kit (Invitrogen, Carlsbad, USA) is used for the reverse transcription 
reaction. PCR amplification of the candidate cDNA is carried out using the reverse primer 

25 sequence located at the translation start of the candidate gene in 5' - 3' direction. This is 
performed with high-fidelity Taq polymerase (Invitrogen, Carlsbad, USA). 

The PCR fragment is then cloned into pCR2.1-TOPO (Invitrogen) or the pGEM-T easy 
vector (Promega Corporation, Madison, Wis.) per the manufacturer's instructions, and 
several individual clones are subjected to sequencing analysis. 

30 6.3 DNA sequencing : DNA preps for 2-4 independent clones are miniprepped 

following the manufacturer's instructions (Qiagen). DNA is subjected to sequencing analysis 
using the BigDye™ Terminator Kit according to manufacturer's instructions (Applied 
Biosystems Inc., Foster City, CA) . Sequencing makes use of primers designed to both 
strands of the predicted gene of interest. DNA sequencing is performed using standard dye- 
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terminator sequencing procedures and automated sequencers (models 373 and 377; Applied 
Biosystems). All sequencing data are analyzed and assembled using the Phred/Phrap/Consed 
software package (University of Washington) to an error ratio equal to or less than 10" 4 at the 
consensus sequence level. 
5 The consensus sequence from the sequencing analysis is then to be validated as being 

intact and the correct gene in several ways. The coding region is checked for being full 
length (predicted start and stop codons present) and uninterrupted (no internal stop codons). 
Alignment with the gene prediction and BLAST analysis is used to ascertain that this is in 
fact the right gene. 

1 0 The clones are sequenced to verify their correct amplification. 

Example 7 : Functional analysis in plants 

A plant complementation assay can be used for the functional characterization of the 
abiotic stress genes according to the invention. 

15 Rice and Arabidopsis putative orthologue pairs are identified using BLAST 

comparisons, TFASTXY comparisons, and Double-Affine Smith- Waterman similarity 
searches. Constructs containing a rice cDNA or genomic clone inserted between the promoter 
and terminator of the Arabidopsis orthologue are generated using overlap PCR (Horton et al., 
Gene. 77: 61-68, 1989) and GATEWAY cloning (Life Technologies Invitrogen. Carlsbad, 

20 CA). For ease of cloning, rice cDNA clones are preferred to rice genomic clones. A three 
stage PCR strategy is used to make these constructs. 

(1) In the first stage, primers are used to PCR amplify: (i) 2Kb upstream of the 
translation start site of the Arabidopsis orthologue, (ii) the coding region or cDNA of the rice 
orthologue, and (iii) the 500 bp immediately downstream of the Arabidopsis orthogue's 

25 translation stop site. Primers are designed to incorporate onto their 5' ends at least 16 bases 
of the 3* end of the adjacent fragment, except in the case of the most distal primers which 
flank the gene construct (the forward primer of the promoter and the reverse primer of the 
terminator). The forward primer of the promoters contains on their 5' ends partial AttBl 
sites, and the reverse primer of the terminators contains on their 5* ends partial AttB2 sites, 

30 for Gateway cloning. 

(2) In the second stage, overlap PCR is used to join either the promoter and the 
coding region, or the coding region and the terminator. 

(3) In the third stage, either the promoter-coding region product can be joined to 
the terminator or the coding region-terminator product can be joined to the promoter, using 
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overlap PCR and amplification with full! Att site-containing primers, to link all three 
fragments, and put full Att sites at the construct termini. 

The fused three-fragment piece flanked by Gateway cloning sites are introduced into 
the LTI donor vector pDONR20l using the BP clonase reaction, for confirmation by 
5 sequencing. Confirmed sequenced constructs are introduced into a binary vector containing 
Gateway cloning sites, using the LR clonase reaction such as, for example, pAS200. 

The pAS200 vector was created by inserting the Gateway cloning cassette RfA into the 
Acc65I site of pNOV3510. 

pNOV3510 was created by ligation of inverted pNOV21 14 VSI binary into pCTK7- 
10 PTX5'AtPPONOS. 

pNOV21 14 was created by insertion of virGN54D (Pazour et al., J. Bacteriol. 
174:4169-4174, 1992) from pAD1289 (Hansen et al., Proc. Natl. Acad. Sci. USA 91 :7603- 
7607, 1994) into P HiNK085. 

pHiNK085 was created by deleting the 35S:PMI cassette and M13 ori in pVictorHiNK. 
15 pPVictorHiNK was created by modifying the T-DNA of pVictor (described in WO 

97/041 12) to delete Ml 3 derived sequences and to improve its cloning versatility by 
introducing the BIGLINK polylinker. 

The sequence of the pVictor HiNK vector is disclosed in SEQ ID NO: 5 of WO 
00/6837, which is incorporated herein by reference. The pVictorHiNK vector contains the 
20 following constituents that are of functional importance: 

- The origin of replication (ORI) functional in Agrobacterium is derived from the 
Pseudomonas aeruginosa plasmid pVSl (Itoh et al., Plasmid. 1 1 : 206-220 1984; Itoh 
and Haas, Gene. 36: 27-36, 1985). The pVSl ORI is only functional in Agrobacterium 
and can be mobilized by the helper plasmid pRK2013 from E.coli into A. iumefaciens 

25 by means of a triparental mating procedure (Ditta et al, Proc. Natl. Acad. Sci USA. 

77:7347-7351, 1980). 

- The ColEl origin of replication functional in E. coli is derived from pUC19 
(Yannisch-Perron et al., Gene, 33:103-1 19, 1985). 

- The bacterial resistance to spectinomycin and streptomycin encoded by a 0.93 kb 

30 fragment from transposon Tn7 (Fling et al., Nucl. Acids Res.. 13:7095, 1985) functions 

as selectable marker for maintenance of the vector in E. coli and Agrobacterium. The 
gene is fused to the tac promoter for efficient bacterial expression (Amman et al., Gene. 
25:167-178, 1983). 

- The right and left T-DNA border fragments of 1 .9 kb and 0.9 kb that comprise 
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the 24 bp border repeats, have been derived from the Ti-plasmid of the nopal ine type 
Agrobacterium tumefaciens strains pTiT37 (Yadav et al., Proc. Natl. Acad. Sci. USA.. 
79:6322-6326, 1982). 

The plasraid is introduced into Agrobacterium tumefaciens GV310lpMP90 by 
5 electroporation. The positive bacterial transformants are selected on LB medium 

containing 50 jig//il kanamycin and 25 jxg/jil gentamycin. Plants are transformed by 
standard methodology (e.g., by dipping flowers into a solution containing the 
Agrobacterium) except that 0.02% Silwet -77 (Lehie Seeds, Round Rock, TX) is added 
to the bacterial suspension and the vacuum step omitted. Five hundred (500) mg of seeds 
10 are planted per 2 ft 2 flat of soil and plant transformants are selected by spraying with the 
herbicide formulated BASTA (2 ml of Finale, AgrEvo Environmental Health, Montvale, 
NJ, is added to 498 ml water) once every two days, for a week. 



Example 8: Vector construction for overexpression and sene "knockout " 
15 experiments 

8.1 Overexpression 

Vectors used for expression of full-length "abiotic stress candidate genes" of interest in 
plants (overexpression) are designed to overexpress the protein of interest and are of two 
general types, biolistic and binary, depending on the plant transformation method to be used. 

20 

For biolistic transformation (biolistic vectors), the requirements are as follows: 

1 . a backbone with a bacterial selectable marker (typically, an antibiotic resistance gene) 
and origin of replication functional in Escherichia coli (E. coli; eg. ColEl), and 

2. a plant-specific portion consisting of: 

25 a. a gene expression cassette consisting of a promoter (eg. ZmUBIint MOD), the 

gene of interest (typically, a full-length cDNA) and a transcriptional 
terminator (eg. Agrobacterium tumefaciens nos terminator); 
b. a plant selectable marker cassette, consisting of a promoter (e.g. rice ActlD- 
BV MOD), selectable marker gene (e.g. phosphomannose isomerase, PMI) 

30 and transcriptional terminator (e.g. CaMV terminator). 

Vectors designed for transformation by Agrobacterium tumefaciens {A. tumefaciens; binary 
vectors) consist of: 
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1 . a backbone with a bacterial selectable marker functional in both £. coii and A. 
tumefaciens (e.g. spectinomycin resistance mediated by the aadA gene) and two 
origins of replication, functional in each of aforementioned bacterial hosts, plus the A. 
tumefaciens virG gene; 

5 2. a plant-specific portion as described for biolistic vectors above, except in this instance 

this portion is flanked by A. tumefaciens right and left border sequences which 
mediate transfer of the DNA flanked by these two sequences to the plant. 

8,2 Knock out vectors 

Vectors designed for reducing or abolishing expression of a single gene or of a family 
10 or related genes (knockout vectors) are also of two general types corresponding to the 
methodology used to downregulate gene expression: antisense or double-stranded RNA 
interference (dsRNAi). 

(a) Anti-sense 

For antisense vectors, a full-length or partial gene fragjnent (typically, a portion of the 
15 cDNA) can be used in the same vectors described for full-length expression, as part of the 
gene expression cassette. For antisense-mediated down-regulation of gene expression, the 
coding region of the gene or gene fragment will be in the opposite orientation relative to the 
promoter; thus, mRNA will be made from the non-coding (antisense) strand in planta. 

(b) dsRNAi 

20 For dsRNAi vectors, a partial gene fragment (typically, 300 to 500 basepairs long) is 

used in the gene expression cassette, and is expressed in both the sense and antisense 
orientations, separated by a spacer region (typically, a plant intron, e.g. the OsSHl intron 1, 
or a selectable marker, e.g. conferring kanamycin resistance). Vectors of this type are 
designed to form a double-stranded mRNA stem, resulting from the basepairing of the two 

25 complementary gene fragments in planta. 

Biolistic or binary vectors designed for overexpression or knockout can vary in a 
number of different ways, including e.g. the selectable markers used in plant and bacteria, the 
transcriptional terminators used in the gene expression and plant selectable marker cassettes, 
30 and the methodologies used for cloning in gene or gene fragments of interest (typically, 
conventional restriction enzyme-mediated or Gateway™ recombinase-based cloning). An 
important variant is the nature of the gene expression cassette promoter driving expression of 
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the gene or gene fragment of interest in most tissues of the plants (constitutive, eg. ZmUBIint 
MOD), in specific plant tissues (eg. maize ADP-gpp for endosperm-specific expression), or 
in an inducible fashion (eg. GAL4bsBzl for estradiol-inducible expression in lines 
constitutively expressing the cognate transcriptional activator for this promoter). 

5 

Example 9: Insertion of an "abiotic stress candidate pene" into an expression 

vector 

A validated rice cDNA clone in pCR2.1-TOPO or the pGEM-T easy vector is 
subcloned using conventional restriction enzyme-based cloning into a vector, downstream of 

10 the maize ubiquitin promoter and intron, and upstream of the Agrobacterium tumefaciens nos 
3' end transcriptional terminator. The resultant gene expression cassette (promoter, "abiotic 
stress candidate gene" and terminator) is further subcloned, using conventional restriction 
enzyme-based cloning, into the pNOV21 17 binary vector (Negrotto et al, Plant Cell Reports 
19, 798-803, 2000; plasmid pNOV117 discosed in this article corresponds to pNOV21 17 

1 5 described herein), generating pNOVCAND. 

The pNOVCAND binary vector is designed for transformation and over-expression of 
the "abiotic stress candidate gene" in monocots. It consists of a binary backbone containing 
the sequences necessary for selection and growth in Escherichia coli DH-5a (Invitrogen) and 
Agrobacterium tumefaciens LBA4404 (pAL4404; pSBl), including the bacterial 

20 spectinomycin antibiotic resistance aadA gene from £. coli transposon Tn7, origins of 

replication for E. coli (ColEl) and A. tumefaciens (VSl), and the A. tumefaciens virG gene. 
In addition to the binary backbone, which is identical to that of pNOV21 14 described herein 
previously (see Example 7 above), pNOV21 17 contains the T-DNA portion flanked by the 
right and left border sequences, and including the Positech™ (Syngenta) plant selectable 

25 marker (WO 94/20627) and the "abiotic stress candidate gene" gene expression cassette. 
The Positech™ plant selectable marker confers resistance to mannose and in this instance 
consists of the maize ubiquitin promoter driving expression of the PMI (phosphomannose 
isomerase) gene, followed by the cauliflower mosaic virus transcriptional terminator. 
Plasmid pNOV21 17 is introduced into Agrobacterium tumefaciens LBA4404 

30 (pAL4404; pSBl) by electroporation. Plasmid pAL4404 is a disarmed helper plasmid (Ooms 
et al., Plasmid. 7:15-29, 1982). Plasmid pSBl is a plasmid with a wide host range that 
contains a region of homology to pNOV21 17 and a 15.2 kb Kpnl fragment from the 
virulence region of pTiBo542 (Ishida et al., Nat. Biotechnol., 14:745-750, 1996). 
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Introduction of plasmid pNOV21 17 into Agrobacterium strain LBA4404 results in a co- 
integration of pNOV21 1 7 and pSBl . 

Alternatively, plasmid pCIB7613, which contains the hygromycin phosphotransferase 
(hpt) gene (Gritz and Davies, Gene. 25:179-188, 1983) as a selectable marker, may be 
5 employed for transformation. 

Plasmid pCIB7613 (see WO 98/06860, incorporated herein by reference) is selected for 
rice transformation. In pCIB7613, the transcription of the nucleic acid sequence coding 
hygromycin-phosphotransferase (HYG gene) is driven by the corn ubiquitin promoter 
(ZmUbi) and enhanced by com ubiquitin intron I. The 3'polyadenylation signal is provided 
10 by NOS 3' nontranslated region. 

Other useful plasmids include pNADII002 (GAL4-ER-VP16) which contains the yeast 
GAL4 DNA Binding domain (Keegan et ah, Science. 231:699, 1986), the mammalian 
estrogen receptor ligand binding domain (Greene et aL, Science. 231:1150, 1986) and the 
transcriptional activation domain of the HSV VP 16 protein (Triezenberg et al., Genes Dev. . 
15 2:718-729, 1988). Both hpt and GAL4-ER-VP16 are constitutively expressed using the 
maize Ubiquitin promoter, and pSGCDLl (GAL4BS Bzl Luciferase), which carries the 
firefly luciferase reporter gene under control of a minimal maize Bronzel (Bzl) promoter 
with 10 upstream synthetic GAL4 binding sites. All constructs use termination signals from 
the nopaline synthase gene. 

20 

Example 10: Rice Transformation 

pNOVCAND is transformed into a rice cultivar (Kaybonnet) using Agrobacterium- 
mediated transformation, and mannose-resistant calli are selected and regenerated. 

Agrobacterium is grown on YPC solid plates for 2-3 days prior to experiment initiation. 
25 Agrobacterial colonies are suspended in liquid MS media to an OD of 0.2 at X.600nm. 

Acetosyringone is added to the agrobacterial suspension to a concentration of 200|oM and 
agro is induced for 30min. 

Three-week-old calli which are induced from the scutellum of mature seeds in the N6 
medium (Chu et al., Sci. Sin.. 18:659-668, 1975) are incubated in the agrobacterium solution 
30 in a 1 00 x 25 petri plate for 30 minutes with occasional shaking. The solution is then 

.removed with a pipet and the callus transfered to a MSAs medium which is overlayed with 
sterile filter paper. 

Co-Cultivation is continued for 2 days in the dark at 22°C. 
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Calli are then placed on MS-Timetin plates for 1 week. Afler that they are tranfered to 
PAA + mannose selection media for 3 weeks. 

Growing calli (putative events) are picked and transfered to PAA+ mannose media and 
cultivated for 2 weeks in light. 
5 Colonies are tranfered to MS20SorbKinTim regeneration media in plates for 2 weeks in 

light. Small plantlets are transferred to MS20SorbKinTim regeneration media in GA7 
containers. When they reach the lid, they are transfered to soil in the greenhouse. 

Expression of the "abiotic stress candidate gene" in transgenic To plants is analyzed. 
10 Additional rice cultivars, such as but not limited to, Nipponbare, Taipei 309 and Fuzisaka 2 
are also transformed and assayed for expression of the "abiotic stress candidate gene" 
product and enhanced protein expression. 

Example 11: Analysis of mutant and transgenic plant material 

15 11.1 Testing Arabidopsis seedlings using agar plates 

Arabidopsis seedlings can be assayed for abiotic stress phenotypes by measuring root 
growth rate under control and experimental conditions (Wu et al., Plant Cell 8:617-627, 
1996). Four to five day-old axenic seedlings are produced by germinating surface-sterilized 
seeds on agar growth medium plates oriented vertically. The seedlings are transferred to agar 

20 growth medium containing inhibitory levels of NaCl or Polyethylene glycol or mannitol or 
kept on the original plate for temperature stress (chilling stress = 4°C, freezing stress < 0°C, 
heat stress > 37°C) and control seedlings are transferred to a new plate containing normal 
growth medium. Upon transfer (or exposure to the temperature extreme), the plates are 
rotated 180 degrees so that subsequent root growth occurs in the exact opposite direction to 

25 previous growth due to the agravitropic root response. Growth subsequent to exposure to the 
abiotic stress is measured as the growth that occurs after bending of the root. Sensitivity or 
tolerance to abiotic stress is expressed as percent of experimental growth versus control 
growth following transfer. 

30 11.2 Testing Arabidopsis or rice plants growing in soil 

Adult Arabidopsis plants can be assayed for abiotic stress phenotypes by exposing 
soil-grown plants to abiotic stresses such as NaCl-, osmotic-, drought-, or temperature-stress 
and scoring for survivability following exposure (Wu et al., Plant Cell. 8:617-627, 1996; 
Kasuga et al., Nat. Biotech.. 17:287-291, 1999). Three-week-old plants are exposed to 
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abiotic stress conditions by sub-irrigating with NaCI, mannitol or polyethylene glycol, or the 
pots are moved from normal growing temperature to <0°C or > 37°C, or water is withheld. 
Resistant and sensitive phenotypes can be distinguished within 2 to 5 days of treatment 
depending on the stress (Kasuga et al., Nat. Biotech. . 17:287-291, 1999). Similar methods 
5 have been applied to cereal plants such as rice (Babu et al., Crop Sci. . 39:150-158, 1999; 
Saijo et al., Plant J., 23:319-327, 2000). It is also possible to assay abiotic stress phenotypes 
using plants growing hydroponically (Moons et al., Plant PhvsioL 107:177-186, 1995; 
Kawaski et al, Plant Cell. . 13:889-905, 2001). Young plants are grown in liquid nutrient 
medium and the stress treatments are applied by mixing in the stressful compounds such as 
10 NaCI, mannitol or Polyethylene glycol and assessing growth visually. 

Example 12: Chromosomal Markers to Identify the Location of a Nucleic Acid Sequence 

The sequences of the present invention can also be used for SSR mapping. SSR 
mapping in rice has been described by Miyao et al. (DNARes., 3:233, 1996) and Yang et al. 

15 (MoL Gen. Genet. , 245:187, 1994), and in maize by Ahn et al. (Mol. Gen. Genet., 241 :483, 
1993). SSR mapping can be achieved using various methods. In one instance, 
polymorphisms are identified when sequence specific probes flanking an SSR contained 
within a sequence are made and used in polymerase chain reaction (PCR) assays with 
template DNA from two or more individuals or, in plants, near isogenic lines. A change in 

20 the number of tandem repeats between the SSR-flanking sequence produces differently sized 
fragments (U.S. Patent No. 5,766,847). Alternatively, polymorphisms can be identified by 
using the PCR fragment produced from the SSR-flanking sequence specific primer reaction 
as a probe against Southern blots representing different individuals (Refseth et al., 
Electrophoresis, 18:1519, 1997). Rice SSRs can be used to map a molecular marker closely 

25 linked to functional gene, as described by Akagi et al. (Genome 39:205, 1996). 

The sequences of the present invention can be used to identify and develop a variety 
of microsatellite markers, including the SSRs described above, as genetic markers for 
comparative analysis and mapping of genomes. 

Some of the polynucleotides disclosed herein contain at least 3 consecutive di-, tri- or 

30 tetranucleotide repeat units in their coding region that can potentially be developed into SSR 
markers. Trinucleotide motifs that can be commonly found in the coding regions of said 
polynucleotides and easily identified by screening the polynucleotides sequences for said 
motifs arc, for example: CGG; GCC, CGC, GGC, etc. Once such a repeat unit has been 
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found, primers can be designed which are complementary to the region flanking the repeat 
unit and used in any of the methods described below. 

Sequences of the present invention can also be used in a variation of the SSR 
technique known as inter-SSR (ISSR), which uses microsatellite oligonucleotides as primers 
5 to amplify genomic segments different from the repeat region itself (Zietkiewicz et al., 

Genomics . 20:176, 1994). ISSR employs oligonucleotides based on a simple sequence repeat 
anchored or not at their 5'- or 3'-end by two to four arbitrarily chosen nucleotides, which 
triggers site-specific annealing and initiates PCR amplification of genomic segments which 
are flanked by inversely orientated and closely spaced repeat sequences. In one embodiment 

10 of the present invention, microsatellite markers, or substantially similar sequences, or allelic 
variants thereof, may be used to detect the appearance or disappearance of markers indicating 
genomic instability as described by Leroy et al. ( Electron. J. BiotechnoL . 3(2), at 
http://www.ejb.org (2000)), where alteration of a fingerprinting pattern indicated loss of a 
marker corresponding to a part of a gene involved in the regulation of cell proliferation. 

15 Microsatellite markers are useful for detecting genomic alterations such as the change 

observed by Leroy et al {Electron. JBiotechnol, 3(2), supra (2000)) which appeared to be 
the consequence of microsatellite instability at the primer binding site or modification of the 
region between the microsatellites, and illustrated somaclonal variation leading to genomic 
instability. Consequently, sequences of the present invention are useful for detecting 

20 genomic alterations involved in somaclonal variation, which is an important source of new 
phenotypes. 

In addition, because the genomes of closely related species are largely syntenic (that 
is, they display the same ordering of genes within the genome), these maps can be used to 
isolate novel alleles from wild relatives of crop species by positional cloning strategies. This 
25 shared synteny is very powerful for using genetic maps from one species to map genes in 
another. For example, a gene mapped in rice provides information for the gene location in 
maize and wheat. 



Example 13: Quantitative Trait Linked Breeding 
30 Various types of maps can be used with the sequences of the invention to identify 

Quantitative Trait Loci (QTLs) for a variety of uses, including marker-assisted breeding. 
Many important crop traits are quantitative traits and result from the combined interactions of 
several genes. These genes reside at different loci in the genome, often on different 
chromosomes, and generally exhibit multiple alleles at each locus. Developing markers, 
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tools, and methods to identify and isolate the QTLs involved in a trait, enables marker- 
assisted breeding to enhance desirable traits or suppress undesirable traits. The sequences 
disclosed herein can be used as markers for QTLs to assist marker-assisted breeding. The 
sequences of the invention can be used to identify QTLs and isolate alleles as described by Li 
5 et al. in a study of QTLs involved in resistance to a pathogen of rice. (Li et al., Mol. Gen. 
Genet.. 261:58, 1999). In addition to isolating QTL alleles in rice, other cereals, and other 
monocot and dicot crop species, the sequences of the invention can also be used to isolate 
alleles from the corresponding QTL(s) of wild relatives. Transgenic plants having various 
combinations of QTL alleles can then be created and the effects of the combinations 
10 measured. Once an ideal allele combination has been identified, crop improvement can be 
accomplished either through biotechnological means or by directed conventional breeding 
programs. (Flowers et al., J. Exp. Bot. . 51 :99, 2000); Tanksley and McCouch, Science. 
277:1063, 1997). 

15 Example 14: Marker-Assisted Breeding 

Markers or genes associated with specific desirable or undesirable traits are known 
and used in marker assisted breeding programs. It is particularly beneficial to be able to 
screen large numbers of markers and large numbers of candidate parental plants or progeny 
plants. The methods of the invention allow high volume, multiplex screening for numerous 

20 markers from numerous individuals simultaneously. 

Markers or genes associated with specific desirable or undesirable traits are known 
and used in marker assisted breeding programs. It is particularly beneficial to be able to 
screen large numbers of markers and large numbers of candidate parental plants or progeny 
plants. The methods of the invention allow high volume, multiplex screening for numerous 

25 markers from numerous individuals simultaneously. 

A multiplex assay is designed providing SSRs specific to each of the markers of 
interest. The SSRs are linked to different classes of beads. All of the relevant markers may 
be expressed genes, so RNA or cDNA techniques are appropriate. RNA is extracted from 
root tissue of 1000 different individual plants and hybridized in parallel reactions with the 

30 different classes of beads. Each class of beads is analyzed for each sample using a 
microfluidics analyzer. For the classes of beads corresponding to qualitative traits, 
qualitative measures of presence or absence of the target gene are recorded. For the classes 
of beads corresponding to quantitative traits, quantitative measures of gene activity are 
recorded. Individuals showing activity of all of the qualitative genes and highest expression 
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levels of the quantitative traits are selected for further breeding steps. In procedures wherein 
no individuals have desirable results for all the measured genes, individuals having the most 
desirable, and fewest undesirable, results are selected for further breeding steps. In either 
case, progeny are screened to further select for homozygotes with high quantitative levels of 
5 expression of the quantitative traits. 

Example 15: Method of modifying the eene frequency 

The invention further provides a method of modifying the frequency of a gene in a 
plant population, including the steps of: identifying an SSR within a coding region of a gene; 

10 screening a plurality of plants using the SSR as a marker to determine the presence or 

absence of the gene in an individual plant; selecting at least one individual plant for breeding 
based on the presence or absence of the gene; and breeding at least one plant thus selected to 
produce a population of plants having a modified frequency of the gene. The identification 
of the SSR within the coding region of a gene can be accomplished based on sequence 

15 similarity between the nucleic acid molecules of the invention and the region within the gene 
of interest flanking the SSR. 

The results disclosed herein demonstrate that several polynucleotides, some of which 
were known to function as transcription factors, enzymes, and structural proteins, also are 
involved in the response of a plant cell to stress. The identification of stress-regulated genes as 
20 disclosed herein provides a means to identify stress-regulated regulatory elements present in rice 
nucleotide sequences, including consensus regulatory elements. Furthermore, the identification 
of the rice stress-regulated genes provides a means to identify the corresponding homologs and 
orthologs in other plants, including commercially valuable food crops such as wheat, maize, soy, 
and barley, and ornamental plants. . 



25 
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Table 1 

A = ORFs from drought tolerance QTL in rice chromosome 3 
B = Cold induced' genes from the rice genomic region 

syntenic with maize cold tolerance QTL 
C = Genes in the drought QTL on chromos 8 showing >=2x 

change during drought stress 
D = Rice proteins that have a universal stress protein domain 
E = ORFs from rice genomic region that is syntenic to a maize cold 

tolerance QTL 

F = ORFs from drought tolerance QTL in rice chromosome 8 
G = Abiotic stress regulated Universal stress protein 

domain containing genes 
H = Stress-regulated rice genes that are homologs of arabidopsis 

stress regulated genes 
I ■ Genes from drought QTL on Chromos 3 showing >= 2x change up or down 

during drought stress 
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3799 X - 

3800 X 

3801 X 
2009 X 

404 X 

3802 X 

3803 ----- X - - 

4790 ------- X 

2740 X 

3804 X 

4671 - X 

3805 x ------ - 

4600 ------- x 

3806 X ------ - 

569 X 

2760 X - 

3807 X ------ - 

3808 X - - 

3809 - . - - X - - 

3810 X 
7942 - X 

3812 ----- x - - 

3813 X - - - 

3814 X 

3815 X - - - 

3816 X 

56 ----- X - - 

3817 ----- x - - 
3818 X 

3819 ----- x - - 
156 ----- x - - 

3820 ----- x - - 

3821 X 

3822 X ------ - 

3823 X 

3824 X 

3825 X - - 

3826 X ------ - 

3827 ----- x - - 

3828 ----- x - - 

3829 X - - 

3830 X 

3831 X 

3832 ----- x - - 

3833 X 

3834 ----- x - - 

3835 ----- X - - 
1147 X 

5278 - X X 

3836 X - - - 

3837 X - - - 

3838 X - - - 
394 X 

3839 X ------ - 

3840 X 

3841 X 

5606 - - - - - - - " X 

3842 X ------ - 

3843 X ------ - 

3844 X ------ - 

3845 X 

3846 X 



WO 03/008540 



PCT/US02/19668 



117 

4399 X 

3847 X 

1678 X 

1313 X 

3848 X ------- - 

3849 X - 

3850 X 

3851 X - 

3852 X - 

3853 X-- 

3854 X ------- - 

3855 X 

3856 X 

3857 X - - 

7989 -------- X 

3859 X 

1053 X - - - 

3860 X - 

3861 X - - - 



4665 ------- X 

3862 X 

3863 ----- X - - 

3864 ----- X - - 

3865 X 

4614 ------- X 



3866 X ------ " 

3867 X 

3868 X - - - 

3869 x ------ - 

2182 X 

3870 X ------ - 

3871 X ------ - 

3872 X ------ - 

3873 X ------ - 

3874 X 

5528 X 

3875 ----- x - - 

3876 ----- X - - 
2414 X 

4715 - -- -- -- X 

3877 X 

3878 X - 

3879 X ------ - 

3880 X 

4998 - X 

3881 X 

3882 ----- x - - 

3883 X 

3884 X 

3885 X 

3886 X ------ - 

3887 X 

3888 X 

3889 X - - 

3890 ----- x - - 
6524 - X 

3891 X - - - - 

3892 X 

3893 X 

3894 X 

3895 X 

2810 X - - 
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X 
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X 






4071 
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X 
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6142 
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2911 
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3023 
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4093 X 

178 X 

2701 X ------- - 

5799 X 

4094 X 

4095 X 

4096 X 

4097 X ------- " 

4098 X - - - - 

4099 X - - " 
4100 X 

4101 X 

4102 X 

4103 X 

4104 X 

4105 X 

4106 X 

4107 X 

4108 X - - - - 
5804 - - - - - - - X 

4109 X 

4110 X 

4111 X ------- - 

4112 X - - 

4113 X ------- " 

6635 ------- X - 

4114 X 

4115 X 

4116 ----- X - - - 
5548 -------- X - 

4117 X ------- " 

4118 X ------- ~ 

4119 X 

4120 X 

4121 X ----- - 

4122 X 

4123 X 

2308 X - - 

4124 X 

4125 X 

4126 X 

4127 X 

4128 X 

4129 X 

4130 X 

4131 - - - - X 
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Table 2. Expressed sequence tags from banana, wheat and maize having homology to the 
abiotic stress responsive polynucleotides of the present invention. 



SeqID 


BANANA 


WHEAT 


MAIZE 


l 


- 


10978 


■ 


2 


9424 


10548 


12773 


3 


- 


10164 


12546 


4 


- 


11828 


13040 


6 


9321 


11158 


12911 


7 


- 


10172 




8 


9704 


11791 


12284 


9 




11268 


12668 


10 


9455 


9956 


13298 


11 


— 


9920 




13 




11383 


13282 


14 




11224 


12268 


15 




11281 




16 


9511 


11430 


13049 


17 


9654 


10905 


12734 


18 


9779 


10199 


12291 


19 




11020 


13191 


21 


9396 


10584 


T 1 "i O £ 

12226 


22 




11885 




23 




11932 




24 


9365 


10132 


1304 3 


26 


9830 


11796 


12992 


27 


9237 


11511 


12372 


28 




10674 




29 




11813 


133 01 


30 






12187 


31 




11563 


13157 


32 


9643 


11085 


12066 


33 




11605 


12696 


34 


9814 


11093 


1282 9 


35 


9684 


11774 


12345 


36 


_ 


9972 


12926 


37 


9566 


11383 


12927 


38 




10334 




39 


9436 


10137 


12013 


41 


_ 


11552 


12894 


42 


9770 


9938 


12698 


43 


9476 


10860 


12729 


45 






12514 


46 




10983 




47 


9599 


10373 


13032 


48 




10216 




49 


9303 


11394 




50 


9289 


9892 




51 


9475 


10811 


12378 


52 




11737 




53 


9342 


10276 


12421 


54 - 




.113.86. 


12253_ 


55 




11457 


12503 


56 


9313 


10417 


12338 


57 




11064 


12324 


56 






12472 


59 


9225 


10611 


12365 


60 


9391 


11870 


13366 
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0 1 




11695 


12510 


C 7 

b c 




11317 




b J 


Q77fi 


11595 
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C A 

b* 
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6 b 
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67 
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69 
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70 




lUJlb 




71 
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72 
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110 1/ 
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7 3 






ijvou 


t *i 






12702 








12033 






1 1 174 


12 916 
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1 t 


Q£ Q7 
70 0/ 


11 '33 


1 1727 


7 fl 






12 861 


7 Q 




11255 


12103 


ft 0 


7* J O 


10373 


13032 


fit 




10530 


12290 


R7 




10026 


13279 


R7 

O -J 


7^ o u 


10431 


12832 


Rd 

o** 


9504 


10917 




8 5 




11840 


12385 


R 

O D 


9546 


11314 


12697 


flft 

o o 


9250 


11675 


13371 


8 9 




11049 


12557 


90 




10193 


13225 


93 


93 12 


11339 


12 858 


94 


9402 


11707 


13297 


96 




11506 




98 


9841 


11859 


12364 


99 


9429 


11190 


12186 


inn 


9507 


9900 




101 




10415 


12308 


102 


9390 


11852 




103 


9682 


11047 


12151 


104 




11522 


12998 


105 




10649 




106 


9851 


10789 


12347 


107 


9446 


10955 


12129 


108 




10319 




109 




9946 


12614 


110 




9683 


13065 


111 


9557 


10563 




112 


9456 


11149 


12663 


114 




9992 




11*; 
iij 


9498 


10943 


12329 


1 1 c 
11D 


9772 


11378 




117 
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7 £ £ r 
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1 1 Q 
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1 7 o 

1Z u 




i m 77 
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9382 


10202 * 


13285 


122 


9421 


10262 


12183 


123 


9701 


11671 


12997 


124 


9589 


11490 


12069 


125 


9870 


10942 


13257 


126 




10265 


12447 


127 




11046 


12056 


128 


9698 


11832 


12527 


129 


9799 


11356 


12660 
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12744 


1 CIA 




10498 
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11844 


13019 
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11383 
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158 


9626 


11508 


12834 
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9990 
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13071 
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10150 


12304 


1 £7 




10697 




ID J 




10321 


13367 






11097 


12094 
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13048 
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IvlJ ' 


12981 
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-1 CD 
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10515 








10628 


12573 
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9456 


11145 


12663 


171 




11016 


12355 


172 




9986 


13302 


173 




11544 


13083 
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10990 


12134 


175 


9742 


11076 


12914 
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9441 


11199 


12260 


177 




10232 


13090 
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9324 
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10040 




180 
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1U<543 


1 71 fil 


IOC 
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10949 


12106 
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10429 


12467 


189 


9477 


10527 


13110 
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10537 




194 




10091 


13210 


195 
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12440 
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one 
Z U b 




i i oo a 
1 1 / z o 
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o o n 
z z u 




1 O0O7 
1 vz u / 




001 




1064 0 


13118 


222 


9251 






001 




1004 0 




2°. 4 


9612 


11070 


13172 


226 


9365 


11738 


13237 


228 


9427 


10640 


12361 


22 9 


94 3 0 


11663 


12945 


230 




10674 


12378 


Z J X 
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11590 


12867 


232 




11941 




oil 
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9365 


11739 


13297 


234 


9579 






om 
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7 / Z U 


11633 


12103 
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10063 




237 


9736 


11128 


12030 


238 




11610 


12668 


239 


9388 


9919 


12269 
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Conclusion 

In light of the detailed description of the invention and the examples presented above, 
it can be appreciated that the several aspects of the invention are achieved. 

It is to be understood that the present invention has been described in detail by way of 
5 illustration and example in order to acquaint others skilled in the art with the invention, its 
principles, and its practical application. Particular formulations and processes of the present 
invention are not limited to the descriptions of the specific embodiments presented, but rather 
the descriptions and examples should be viewed in terms of the claims that follow and their 
equivalents. While some of the examples and descriptions above include some conclusions 
10 about the way the invention may function, the inventor does not intend to be bound by those 
conclusions and functions, but puts them forth only as possible explanations. 

It is to be further understood that the specific embodiments of the present invention as 
set forth are not intended as being exhaustive or limiting of the invention, and that many 
alternatives, modifications, and variations will be apparent to those of ordinary skill in the art 
1 5 in light of the foregoing examples and detailed description. Accordingly, this invention is 
intended to embrace all such alternatives, modifications, and variations that fall within the 
spirit and scope of the invention. 
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What is claimed is: 

1 . An isolated nucleic acid molecule comprising a polynucleotide selected from the group 
consisting of: 

a) any one of the nucleotide sequences selected from the group consisting of SEQ ID 
NOs. 1-4131, 8263-8353, 8445-8829 and 17505-17506; 

b) a functional portion of any of the sequences of a); 

c) a polynucleotide that is substantially similar to a sequence of a) or b); 

d) a sequence of at least 15 nucletides that hybridizes under stringent conditions to a 
polynucleotide of a), b) or c); 

e) the complement of any sequence of a), b), c) or d); 

f) the reverse complement of any sequence of a), b), c) or d); 

g) a polynucleotide encoding any polypeptide selected from the group consisting of SEQ 
ID NOs. 4132-8262, 8354-8444, and 8830-9214; and 

h) an allelic variant of any of the above. 

2. The nucleic acid molecule of claim 1 , wherein said polynucleotide is present in a drought 
tolerance QTL located on rice chromosome 3. 

3. The nucleic acid molecule of claim 1, wherein said polynucleotide is from a genomic region 
syntenic with a maize cold tolerance QTL. 

4. The nucleic acid molecule of claim 1, wherein said polynucleotide is present in a drought 
tolerance QTL on rice chromosome 8. 

5. The nucleic acid molecule of claim 1, wherein said polynucleotide encodes a protein 
containing a Universal Stress Protein A domain. 

6. The nucleic acid molecule of any of the preceding claims, wherein said polynucleotide is 
from a plant. 

7. _The nucleic acid molecule of any of the preceding claims, wherein said polynucleotide is 
from a monocot or a dicot. 

8. The nucleic acid molecule of any of claims 1 -6, wherein said polynucleotide is from a cereal. 
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9. The nucleic acid molecule of claim 8 wherein said cereal is selected from the group 
consisting of maize, rice, wheat, barley, oat, rye, millet, milo, triticale, orchardgrass, guinea 
grass, sorghum and turfgrass. 

10. The nucleic acid of any of the preceding claims, wherein expression of said nucleic acid 
is altered 2X up or down in response to an abiotic stress. 

1 1 . The nucleic acid of claim 10 wherein said abiotic stress is selected from the group 
consisting of cold stress, salt stress, osmotic stress or any combination thereof. 

12. A vector comprising a polynucleotide of any of the preceding claims. 

13. The vector of claim 12, wherein said vector is a cloning vector or an expression vector. 

14. An expression cassette comprising a polynucleotide of any of claims 1-11. 

15. A host cell comprising a vector or expression cassette of any of claims 12-14. 

16. The host cell of claim 15, wherein said host cell is a prokaryotic cell or a eukaryotic cell. 

17. The host cell of claim 16, wherein said host cell is a bacterial cell, an insect cell, a yeast cell, 
a plant cell or an animal cell. 

18. The host cell of claim 17, wherein said host cell is a plant cell. 

19. A plant comprising a vector or expression cassette of any one of claims 12-14. 

20. A plant comprising a host cell of any of claims 15-18. 

21 . A seed from the plant of claim 20. 

22. A seed comprising a vector or expression cassette of any of claims 12-14. 
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23. A plant produced from the seed of claim 22. 
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24. The progeny of a plant of claim 1 9 or 20. 

25. A hybrid of a plant of claim 19 or 20. 

26. An isolated polypeptide comprising: 

a) anyone of the amino acid sequences selected from the group consisting of SEQ ID 
NOs. 4132-8262, 8354-8444, and 8830-9214; 

b) a functional portion of a); 

5 c) an amino acid sequence substantially similar to any sequence of a) or b); 

d) an amino acid sequence of a)-c) wherein there has been at least one conservative 
amino acid substitution; 

e) an allelic variant of any of a)-d). 

27. The polypeptide of claim 26, wherein said polypeptide contains a Universal Stress Protein 
A domain. 

28. The polypeptide of any one of claims 26-27, wherein said polypeptide is from a plant. 

29. The polypeptide of any one of claims 26-28, wherein said polypeptide is from a monocot or 
a dicot. 

30. The polypeptide of any one of claims 26-29, wherein said polypeptide is from a cereal. 

3 1 . The polypeptide of claim 30 wherein said cereal is selected from the group consisting of 
corn, rice, wheat, barley, oat, rye, millet, milo, triticale, orchardgrass, guinea grass, sorghum 
and turfgrass. 

32. The polypeptide of any one of claims 26-31, wherein expression of said polypeptide is 
altered 2X up or down in response to an abiotic stress. 

33. The polypeptide of claim 32 wherein said abiotic stress is selected from the group 
consisting of cold stress, salt stress, osmotic stress or any combination thereof. 
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34. A method for determining whether a test plant has been exposed to at least one abiotic 
stress condition comprising, 

determining expression of a plurality of polynucleotides selected from the group 
consisting of SEQ ID NOs. 1-4131, 8263-8353, 8445-8829 and 17505-17506 in said test plant 
to produce an expression profile, and 

comparing said expression profile of said test plant to an expression profile for the 
same plurality of polynucleotides in a reference plant exposed to at least one abiotic stress. 

35. The method of claim 34, wherein said expression profile is produced by 

isolating RNA from a plant; 

contacting said RNA with a plurality of probes which selectively hybridize to the 
expressed polynucleotides under stringent conditions; and 

and detecting hybridization of said RNA to said probes. 

36. The method of claim 35 wherein said RNA is mRNA 

37. The method of claim 35 or 36, wherein said RNA is converted to cDNA 

38. The method of any of claims 34-37, wherein the abiotic stress is selected from the group 
consisting of saline stress, cold stress, osmotic stress or any combination thereof. 

39. The method of any of claims 34-38, wherein said plurality of polynucleotides comprises 
at least 10 polynucleotides. 

40. The method of any of claims 34-38, wherein said plurality of polynucleotides comprises 
at least 25 polynucleotides. 

41 . The method of any of claims 34-38, wherein said plurality of polynucleotides comprises 
at least 50 polynucleotides. 

42. The method of any of claims 34-38, wherein said plurality of polynucleotides comprises 
at least 100 polynucleotides. 
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43. The method of any of claims 34-38, wherein said plurality of polynucleotides comprises 
at least 250 polynucleotides. 

44. The method of any of claims 34-38, wherein said plurality of polynucleotides comprises 
at least 500 polynucleotides. 

45. The method of any of claims 34-38, wherein said plurality of polynucleotides comprises 
at least 750 polynucleotides, 

46. The method of any of claims 34-38, wherein said expression profile is accomplished 
using a microarray. 

47. The method of any of claims 34-46, wherein said plant is selected from the group 
consisting of maize, rice, wheat, barley, oat, rye, millet, orchardgrass, guinea grass, sorghum 
and turfgrass. 

48. A method for altering the tolerance of a plant to an abiotic stress comprising introducing 
into said plant a recombinant nucleic acid construct comprising at least one of the 
polynucleotides selected from the group consisting of SEQ ID NOs. 1-4131, 8263-8353, 
8445-8829, and 11374-17506. 

49. A method for altering the tolerance of a plant to an abiotic stress comprising introducing 
into said plant a recombinant nucleic acid construct comprising a polynucleotide of at least 
15 nucleotides in length that hybridizes under high stringency conditions to the complement 
of a sequence selected from the group consisting of SEQ ID NOs. 1-4131, 8263-8353, 8445- 

5 8829, and 17505-17506. 

50. The method of claim 49, wherein said polynucleotide is operably linked to a 
heterologous promoter. 

5 1 . A method for altering the tolerance of a plant to an abiotic stress comprising introducing 
into said plant a recombinant nucleic acid construct, wherein said construct encodes a 
molecule which alters expression of an abiotic stress regulated coding region selected from 
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the group consisting of SEQ ID NOs. 1-4131, 8263-8353, 8445-8829, and 17505-17506, or 
5 the complement thereof. 

52. The method of claim 5 1 wherein said molecule comprises an antisense RNA, double 
stranded RNAi sequence, a triplexing agent, and a molecule containing a dominant negative 
mutation. 

53. A plant produced by the method of any of claims 48-52. 

54. A seed from a plant of claim 53. 

55. A method for selecting an agent that alters abiotic stress regulated polynucleotide 
expression in a plant cell comprising: 

contacting at least one plant cell with a test agent; 

subjecting said plant cell to an abiotic stress before, during or after contacting said 
5 plant cell with said test agent; 

obtaining an expression profile of said plant cell wherein said expression profile 
comprises expression data for at least one abiotic stress regulated sequence selected from the 
group consisting of SEQ ID NOs. 1-4131, 8263-8353, 8445-8829, and 17505-17506.; 

comparing the expression profile of said cell to the expression profile obtained from 
10 at least one plant cell not exposed to said agent, but exposed to said abiotic stress; 

selecting said agent if said agent alters expression of said abiotic stress regulated 
sequence. 

56. The method of claim 55, wherein said plant cell is a cereal plant cell. 

57 The method of any one of claims 55-56, wherein said alteration in expression is at least 
about two-fold (2X). 

58. The method of any one of claims 55-57, wherein said at least one abiotic stress regulated 
sequence comprises at least 10 sequences. 

59. The method of any one of claims 55-57, wherein said at least one abiotic stress regulated 
sequence comprises at least 100 sequences. 
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60. The method of any one of claims 55-57, wherein said at least one abiotic stress regulated 
sequence comprises at least 500 sequences. 

61 . A computer readable medium having stored thereon computer executable instructions for 
performing a method comprising, 

receiving data on expression in a test plant of at least one polynucleotide having at 
least 70% sequence identity to a nucleotide sequence selected from the group consisting of 
5 SEQ ID NOs. 1-4131, 8263-8353, 8445-8829, and 17505-17506, or the complement thereof; 
and 

comparing expression data from said test plant to expressin data for the same at least 
one nucleic acid molecule in a plant which has been exposed to at least one abiotic stress. 

62. A computer-readable medium having stored thereon a data structure comprising, 
sequence data for at least one polynucleotide having at least 70% nucleic acid sequence 
identity to a polynucleotide selected from the group consisting of SEQ 3D NOs. 1-4131, 
8263-8353, 8445-8829, and 17505-17506, or the complement thereof; and a module 

5 receiving the nucleic acid molecule sequence data which compares the nucleic acid molecule 
sequence data to at least one other nucleic acid sequence. 

63. An antibody to any of the polypeptides of claims 26-33. 

64. The antibody of claim 63, wherein said antibody is a polyclonal antibody or a 
monoclonal antibody. 

65. A method for identifying a homolog or ortholog to an abiotic stress responsive 
polynucleotide comprising: 

a) determining the nucleotide sequence of a plurality of isolated polynucleotides to 
create a set of nucleotide sequences; 
5 b) translating the nucleotide sequences in the set to derive one or more putative amino 

acid sequences, based on one or more of the possible reading frames of the nucleotide 
sequences and their complementary sequences; 
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c) selecting an amino acid sequence of an abiotic stress-responsive protein and 
comparing the amino acid sequence of the abiotic stress-responsive protein with at least one 

1 0 of the putative amino acid sequences; 

d) identifying putative amino acid sequences having homology with at least a region 
of the amino acid sequence of the abiotic stress-responsive protein; and 

e) correlating putative amino acid sequences having said homology to translated 
nucleotide sequences of b). 

15 

66. The method of claim 65, further comprising: 

f) assembling a plurality of the translated nucleotide sequences that encode the 
putative amino acid sequences based on regions of overlap comprising at least 10 base pairs 
of identical sequences between two translated nucleotide sequences to form one or more 

5 nucleotide contigs; 

g) translating the one or more nucleotide contigs into one or more amino acid contigs; 

h) comparing the one or more amino acid contigs with the amino aicd sequence of the 
abiotic stress responsive protein to determine homology between the amino acid contig and at 
least a region of the abiotic stress responsive protein; and 

10 i) identifying at least one homo log of at least a region of the abiotic stress responsive 

protein based on the homology determined in h). 

67. The method of claim 65 wherein the abiotic stress responsive polynucleotide is selected 
from the group consisting of SEQ ID NOs. 1-4131, 8263-8353, 8445-8829, and 17505- 

15 17506. 

68. A method for altering the expression of a polynucleotide in response to an abiotic stress 
comprising operably linking said polynucleotide to a regulatory region selected from the 
group consisting of SEQ ID NOs. 13374-17504. 

20 



